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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acic" segment containing signal peptide 
(A=Alanine, C=Cysteine, D^Aspartic Acid, E= 
Glutamic Acid, F= Phenyl al anine , G=Glycine, 
H-Histidine, I=Isoleucine, K^Lysine, 
L= Leucine, M=Methionine , N=Asparagine . 
P=Proline. 0=Giutamine, R=Arginine, 
S=Serine, 7=Threonine, V=Valine, 
W= Tryptophan, Y=Tyrosine, X^Unknown, *=Stop 
Codon, /=pocEible nucleotide deletion, 
\=possible. nucleotide insertion) 


6675 


277 


1678 


GNWPTERMAFLDNPTI I IiAHI RQ S H VTSDDTGMCEMVL I DHDVD 
LEKIHPPSMPGDSGSEIQGSNGETQGYVYAQSVDITSSWDFGIR 
RRSNTA0RLER1RKERQWQIKCKNIQWKERNSKQSACELKSLFE 
KKSLKEKPP 1 F G KQS I LSVRLEQCPLQLNNPFNEYS KFDGKGHV 
GTTATKKIDVYLPLHSS0DRLLPMTWTMAS7u^VODtIGLXCWQ 
YTSEGREPKLKD:>IV£AyCLHIAEDDGEVDTDFPPbDSKEPIHKF 
GFSTLALVEKYS3PGLTSKESLFVRINAAKGFSLIQVDNTKVTM 
KEILLKAVKRaKGSCKVSGSRADGVFEEDSQIDIATVODMLSSH 
HYKS FKVSM1 KR LRFTTDVQL/ GCALFPGVLR KRAAP VDCLR PS 
ADTWRQEO 1 GCCGAACAAIiRS * DSHKC* EG 1 SGDKVE1D PVTNQ 
KASTKFWIKQKPI SIDSDI-LCACNDLAEE 


6676 


277 


1676 


GKWPTERKAFLDNPTIILAHIRQSHVTSDDTGMCEMVL1DHDVD 
LE K I H PPSM PC P3G S E 1 QG SNGETQG YVY AQS VD I TS S W DFG 1 R 
RRSN TAQR LE R 1 /RKER QN Q I KC KN I Q WKEKNS KQSAQE LKSL EE 
KKSLKEKPPISGKOSILSVRLEOCPLQLNKPFNEYSKFDGKGHV 
GTTATKKIDVVLPLHSSQDRIjLPMTVVTMASARVODLIGLI cwq 
YTSEGREPKLKDNVSAYCLHIAEDDGEVDTDFPPLDSNEP1HKF 
GFSTLALVEKY5\8PGLTSKESLFVRINAAKGFSLIQVDNTKVTM 
KEILLKAVKRRKGSCKVSGSRADGVFEEDSQIDIATVODMLSSH 
HYKSFKVSMJ HR IjRFTTDVQL/GCALFPGVLR KRAAP VDCLR PS 
ADTVJ R QEQ I G C CG AAC AALRS * DS HKC * EG I S GDKVE I D P VTNQ 
KASTKFWIKQKFISIDSDLLCAC\DLAEE 


6677 


277 


I67e 


GNWFTERMAFLDNPT3 1 LAHIRQSHVTSDDTGMCE.WL3 DHDVD 
LEK1KPPSMPGDSGSEIOGSNGETQGYVYAOSVDITSSWDFGIR 
RRSNTAQRLEKLRKERQNQIKCKN1QWKERNSKQSAOELKSLFE 
KKSLKEKPP 1 ?G KQSI LS VRLEOCPLQLNNPFNEYSKFDGKGHV 
GTTATKKID\^L?LHSS0DRLLPMTWTMA£ARVQDL1GLICW0 
YTSEGREPKLKrNVSAYCLHIAEDDGEVDTDFPPLDSNEPIHKF 
GFSTLALVEKY f S PGLTS KESLFVRINAAHGFSLI QVDNTKVTK 
KE I LLKAVKRK KGS QKVSGS RADGVFEEDSQ 1 D I ATVQDMLS SH 
HYKS F KVSMI HR LR FTTDVQL/ GCALFPGVLRKRAAP VDCLRP S 
ADTWRQEQI GCCG AACAALRS * DSHKC* EG I SGDKVE1DPVTNQ 
KASTKFWIKQKP I SIL>SDLLCAC\DLAEE 


6676 


221 


665 


GPSNQSSGSLSL1 VTGCSSYWS*INDTCTILRVLSSN T FGRQ*LR 
PFPCSQLPMSOG CLWHLDCCCPWVPY I PGOOWRKGRQRMRN* OS 
LLGSDQBSVGLi: DLCVFVNFLLHVLLGLFP* PHELFLLPWDLG 
FLFPLLLQGGCK CLVLPANLVSQAPQ IGKLSCRLQTKDLEGSRN 
HHPLFLWGRVilAVKHLETVQSGLASLGFVGQHTSHGPP 


6679 


2 


786 


LEFARGAMFFLGODWRSPGQNWVKTVDGWKRFLDEKSGSFVSDL 
S SY CN KE VYN K EK LFNS LN YD/S CSQEEKEGKAE * QNONS \ D FH 
OEKK3 YVHKGS Z KERHGYCTLGEAFNRLDFSTAILDSRRFWYVV 
RLLEL I AKSQL7 £ LSGI AQKNFMN 3 LE KWLKVLEDQON 1 TLI R 
ELLC7LYTSLC7LVKRVGXSVLVGNINMWVYRMETILHWQQ0LN 
N1Q3TRVSG0A0PPPGSGSLHRDTGQTR0DFEFTPVTEESGLF 


6680 


1498 


2551 


PLCTLPLMPSAL-yGWAGERWEKQWPLA/PGPGTWQTPVGSlSEE 
P\RKNEPDTHC V R GEAR PEV * HLPKPHS PGSEGAE I QTSA* AL? 
/NQVS PPQPM * GAEENGDQRGGXEEAGEELHRSSSGLTAAPGF? 
L. VHRN LfjTr PG Xj P S KGGG P / GGAGTQGb WAF^ b>\} V V / s, Fli.br AS 
MQRSQAGLPGWLAGLVESPTHHIPALRPSGWATGEAFPSTTCS 
SGP\ PAPPGPTGLRPGGGSSSGGHG* * PGLPVGK V\ GALGAAQD 
PQSQG RG PTQG T V GTEMLLSGLGS AKAC P AAR P AV P * LPS DPAS 
T I P K KGTRG FGEG PGVLOERNRWWGRAQGFTS ADAAGTAPPGV 
♦LPAPI^QPPGATEPQVRACGMAPPSPGTSGRLVANGRHPGPQV 
AOGC PPGAGCVvG S QPRGSORCPRTYTHSPLGHGRAPCPRR CWH* 
WQDP PSSPRTGCLPGI PARQAYSAPRTRSRPG I RTGRAAYGFIR 
FQGGGGG 



539 



BNSDOCID: <WO_ 01S3312A1_I_> 



St 



WO 01/53312 



PCT/USOD/34263 



SEQ 
2D 
NO: 


Predicted 
beginninc 
nucleotide 
location 
correspondinc 
to first 
amino acid 
residue of 
amino acic 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amine acid segnent containing signal peptide 
(A=Aianine, OCysteine, D=Aspartic Acid, E= 
Glutamic Acic, F-Phenylalanine, G=Glycine, 
R=Histidine, l=Isoleucine , K=Lysine, 
L=beucine, K=Methionine, NsAsparagine , 
P=Prcline, QsGlut amine, R-Arginine, 
S=Serine, T=Threonine, V=Valine, 
W^Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possibie nucleotide insertion; 


6681 


1169 


511 


1 N Y I Y YNQOQRAFH £ LK\ EKLMS APALGL PDLTK L FTLH VS ERE 
fiMTVGVLTOTVGPKSRPGAYLSKQLDGVSKGKPPCPRALAATAL 
LAOEADELTLRONLI"JRKSPHA\WTLINTKGHH*L1NARLTRYQ 
TLLCENPHKTIEVSNT/LNPATLLLVTESPVKHNCLEVLDSVYS 
S RPNLRDHP * TS VDWELYVDGSGFANPCKVTLKKETS PAPVTPR 

c 


6682 


10S 


1238 


TVLCGAMQVSSLNEVKIYSLSCGKSLPEWLSDRKKRALOKKDVD 
VRRRIELI ODFEMPT VCTT I KVS KDGQ Y I LATG TYK PR VRCYDT 
YQLSLKFERCLDSEWTFEILSDDYSKIVFLHNDRYIEFHSQSG 
FYYKTR1PK FGRD FS YH Y PSCDLY FVGASS E VYRLNLE OGR YLN 
PLOTDAAENNVCDINSVHGLPATGTIEGRVECWDPRTRNRVGLL 
D \ AP * TVS QO I OR * T S LPTI S ALKFN\G ALTMA VGTTTG QVLL Y 
DLRSDKPLLVKDHOYGLPIKSVHF0DSLDLILSADSR1VKMWNK 
NSGKI FTSLEPEHDLNDVCIiYPNSGMLLTANETPKMGI Y YI PVL 
GPAPRWCSFLDNLTEELEENPESNE 


6683 


10? 


1236 


TVLCG AMQVSS LNEVK I YS LSCGKS LPEWLSDRKKRALOKKDVD 
V^RRIELI0DF2MPT\ f CTTIKVSKDGQYlLATGTYKPRVRCYDT 
YOLSLKFERCLDSEWTFEILSDDYSKIVFLKNDRYIEFHSQSG 
FYYKTRIPKFGRDFSYHYPSCDLYFVGASSEVYRLNLEOGRYLN 
PLQTDAAENNVCDINSVHGLFATGTIEGRVECWDPRTRNRVGLL 
D \ AP * TVS 00 1 OR * T S1»PT I SALKFN \ G ALTMAVGTTTGQ VLLY 
DLRSDKPLLVKDHOYGLPIKSVHFODSIjDLILSADSRIVKMWNK 
NSGKIFTSLEPEHDLNDVCLYPNSGMLLTANETPKMGIYYIPVL 
GPAPRWCSFLDNLTEELEENPESNE 


66B4 


113 


52 7 


GLRGGTSRGRAGREF E FAAGVLCWAGFCOS PCPPGGPGREAPA 
PP \SGRRKA* RPA+ W LGG PGGDSGGR E EGGS /GELORAMESKMG 
ELPLDINlOEPRWDOSTFliGRARHFFTVTEPRNLLLSGAOLEAS 
RN3V0NYR 


$685 




1473 


KLLGDNFEGFCNKFELSDSENGSNS*QSPb\FDRLFDPDPQKVL 
0GV1DMKNAVIGNNKOKANLIVLGAVPRLLYLLO0ETSSTELKT 
ECAWLGSLAMGTENNVKSLLDCHI I P ALLOGLLS PDLKF I EAC 
LRCLRTIFTSPVTPEELLYTDATVIPHLMALLSRSRYTOEYICQ 
1 FSHCCKGPDHOTI LFNHGAVQNI AHLLTSLSYKVRMQALKCFS 
VLAFEN PQVSMTLVNVLVDGELLPQl FVKMLQRDKP I EMOLTSA 
KCLTYMCRAGAIRTDDNCIVLKTLPCLVRMCSKERLLEERVEGA 
ETLAYLI EPDVELOR I AS I TDHLI AMLADYFK Y PSS VSA1 TDIK 
RLDHDLKHAHELROAAFKLYASLGANDED1 RKKVSLGEGR PP VL 
TASRQGVTST 


6686 


31C 


927 


DSVTFDDLAVDFTPKEWTLLDPTORNLYRDVMLENYKNLATVGY 
0LFKPSLISWLEQEESRTV0RGDFQASEWKVOLKTKELALQQDV 
LGEPTS S G I OM 3 GS HNGGE VSDVKOCGDVSSEKS CLKTHVRTQN 
SENTFECYLYGVDFLTLKKKTSTGEORSVFSHVWKKPSSLNPDV 
VCOKNRCTRKKKAF* LQLTLGKSFH* SIHT 


6687 


18: 


915 


EAMLEAPYKKEEDEQORKEVKKDYPSNTTSSTSNSGNETSGSST 
IGETSNRSRDRDRYRRRNSRSRSPGROCRHRSRSWDRRHGSESR 
SRDHRREDRVHYRS P PLATGEPVDNLS PEERDARTVFCMOLAAR 
IRPRDLEDFFSAVGKVRDVRIISDRNSRRSKGIAYVEFCEIQSV 

PT.ATfiT.TYWl?! T/3VPT TVDa < ?naPTfNT?Ti2L&M&T\TNT,OKRM^nPMR 
run 1 O-u l <2\/i\l-tLAj\ i 1 J. VvnOynDRlWUmWU'llVLy^WUCrrJR 

LYVGS LH FN 3 TEDMLRG I FEPFGKV 


6688 


102i 


1 


AEVPNYPRVFHXCPDSCWRFKFQPIQLOPYILLSFSSEKPPISF 
SEPGLPR / SATARMATAAAPPNSSIDLPSDSGMGFISPAGDSLD 
LPSDGGTGFFSLAGDSSSTRLSSLAFISFSLSSVSVGSSAGTTS 
STSVGSWAAFTSSSSSSTNRDVAGLDFSTVITSVSGSLVPSRE 
VAVICGSKGAGASGSASCSSRAGKTTEATAASSMPSGTSSFSTC 
TMSELEELFSLPSPAPLLSXbPTSSGS I AI CCQDSGPSDTGRLS 
VCOLWLADSDTGKLSDCOEWTVGDSGGLTCPELSLGRM*MSLL 



540 



BNSDOCID: <WO 01 63312A1J_> 



WO 01/53312 



PCT/US00/34263 



SEQ 
ID 
NO: 


Predictec 

IJtZKM JL ill 1 J. i \>— 

mir*l pot" S 

location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 

1 of* a t" S evr 

corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amine acid segment containing Signai peptide 

f TV — 2\ 1 rn i no C*— Pi/c^o "i -no H-lcnavhi/* I» i TT — 

vj— uuoii.iv. ntiU/ r — riJt;iiyj (jj.ciiiJ.nc , ui y J. lit. , 

K=Histidine, lalsoleuc ine, lULysine, 
L-Leucane, M=Methicnine , N=Asparagine, 
P=Proiine,' Q=G3utamine , R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W— Tiryotophan, Y=Tyirosine , X— Unknown, *— Stop 
Codon, /^possible nucleotide deletion, 
\=possibi.e nucleotide :nsertion) 








SSAVIFGYSSSSDSRLNTVPTVDLLCPFQTKSST 


6689 


64 0 


1299 


AEDSTFRICSPSVSDTSSDSSGSKDNVLILFSKVSI+SCFSLSS 
FFSDS1 SFCFSSSSFCKR* FVSSKVSQNALLSSKLSNGPGGSSK 
CRNSLTARQLAMSL* ATKF * RNACNPNCLSSKKSAL* LSLNQR F 
GGSASRKPGNISFKSQKCSALSYCCNFVIKPREVSVSSENYPAF 


6690 




442 


GTRGKHAATLGPIjGSWQOWRRCLSARDGSRMLLLLLLLGSGOGP 
0OVGAGOTFEYLKREHSLSKPYQGVGTGSSSLWNLN5GNAMVMTQ 
Y 3 RLTPDMQS KQGALWNRVPCFLRDWELQVK FXI HGOGKKNlAH 
GDGLAI WYTKDRMQP 


6692 


2E7 


1403 


LKTET$ E EKARR YKDRPSQLNAVFQEQKKH3 QhQES I TLEDVAV 
DFTWEEWQLLGAAOKDLYRDVMLEITYSNLVAVGYOASKPDALFK 
LEQGEOLWTI EDGIKSGACSDI WKVDHVLERLOSES.LVNRRKPC 
KEHDAFEN3 VHCSKSOFLLGONHDI FDLRGKSLKSNLTLVNQSK 
GYEIKKSVEFTGNGDSFLHANHERLHTAIKFPASOKLISTKSQF 
1SPKHQKTRKLEKHKVCSECGKAF1KKSWLTDHQVMHTGEKPHR 
CSLCEKAFSRKFMLTEHORTHTGEKPYECPECGKAFLKKSRLNI 
HQKTHTGEKPY I CSECGKGFIQKGNLI VHQR 3HTGEKPY I CNEC 
/GKGF1CKTCLIAHQRF1ITER 


6692 


17£ 


939 


W3 KEGELSLWERFCANI IKAGPKPXHIAFIMIXSNRRYAkKCQVE 
RQEGHS0GFNKLAETLRWCLNLG3 LEVTVYAFS I ENFKRSKSEV 
DGLMDbA\ROKFSRLMEEKEKLQKHGVCIRVLGDLHLLPLDLQEL 
1 AOA VC ATKN YNKCFLNVCFAY TSR HEISNA VREMAWG VEQG LL 
DPSDISESLLDKCLYTNRSPHPD1LIRTSGEVRLSDFLLWQTSH 
SCLVFCPVLWPEYTFWNLPEA3L0F0MNHSVLQK 


6693 


17f 


939 


WJ KEGELSLWERFCANI I KAGPKPXKIAFIMDGNRR YAKKCQVE 
R0EGHSQGFNKLAETLRMCLNLG1 LEVTVYAFS I ENFKRSKSEV 
IXjLMDLAROKFSRLMEEKEKLOKKGVCIRVLGDLHLLPLDLQEL 
IA0AV0ATKNYNKCFLNVCFAY'J\SKHEISNA\'RE:'1AWGVEQGLL 
DPSDI SESLLDKCLYTNRSPHPD] LI RTSGEVRLSDFLLWQTSH 
bt-iiV i? V LWPE i TF WNJjF C.AI LOFQMNHis VliOK 


6694 


292 


813 


SLLLHl^PPGAYTPSQPLSSVSTETASSVRRgAAESROHELPVR 
EVHSLGQ1 LPQDGLTAEAGPPEAC'DP WGS PG 3 SLPAAH I GFAAA 
LAVGPS GCHTE P \ FDE WJ P SL FLGDA Y AARDKS KL1 QLGITHW 
NAAAG KFQ VDTGAKFYRGMSLE Y YG I EADDNPFF0LS VYFLP 


6695 


292 


813 


SLLLHLAPPGAYTPSOPLSSVSTETASSVRROAAESROKELPVR 

LAVGPS GCHTEP \FDEVWPSLFLGDA YAARDKS KLI OLGITHW 
NAAAG KFQVDTGAKFYRGMSLEYYG I EADDNPFFDLS VYFLP 


6696 


j 


782 


PRVRGRVGERWAFLSVPAAMSSEKEPLLLAWSYFRRRKFOLCAD 
LCTQM LEK S PYDQAAWI LKARALTEM VY IDE 3 DVDOEG IAEMML 
DENAIA0VPRPG7SLKLPGTN0TGGFSQAVRP3T0AGRP3TGFL 

TS PDG P F I NLSRLNLTKYS QKP KLAKAL t E Y 3 FHH EN D VKTALD 
lAALSTEHSOYKDWWWK/DQISKCYYRVGMYREAEKOlKSS 


6697 


3 


782 


PPLFLRRLNSRALRPGSRKVMAVv/FASLSGODVGSFAYLTIKDR 
IPQILTKVIDTLHRHKSEFFEKKGEEGVEAEKKAISLLSKLRNE 
LOTDKPFI PLVEKFVDTDI WNOYLEYCX)SLLNESDGKSRWF YSP 
WLLV\EC YMYRR I HE AI \ I QS P P I DY FDVFKES KEQN F YG S QE S 
IIALCTHLQC)LIRTIEDLD\ENOLKDEFFKLLQISLWGEISVDL 
SL \ SGG ESS SQNTNVLNSLEDLK PF I LLNDMEHLWS LLSNCK 


6698 




754 


VG SCACAG S CKCKECKCTS CK KS EC RAFP 


6699 


32S 


492 


EGELP / PARRVLPRAMTASAQPRGRRPGVGVGVWTSCKHPRCV 
LLGKRKG S VG AGS FQLPGGHLE FG ETWEECAOR ETW E E AALHLK 
Im^FAS^m^SF3EKE^m^YVTILMKGEVI)VTHDSEPK^n^PEKN 



541 



BNSDOCID: <W0 0153312A1J_> 



s 
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PCT/US00/34263 



SEO 
ID 

NC: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted enc 
nucleotide 
location 
corresponding 
to first 
amine acid 
residue of 
amino acid 
seouence 


Amine acid segment containing signal peptide 
(A*A±anine, C=Cy£teine, D=Aspartic Acid, E= 
Glutamic Acid, F^Phenylalanine, G=Glycine, 
H=Histidine, l-i soleucine , K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S-Serine, T=Threonine, V=Valint, 
W=Tryptophan, Y= Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 






J ESKRI I YNHAFFFQES KWSGGILQ 


6700 


109b 


1392 


T0CWRSSTPGMRTHFR70P/RLECGQGFSQ0EKGHCMDTNECIQ 
FPFVCPRDKPVCVNTYGSYRCRTKKKCSRGYEPNEDGTACVERT 
LLLGLCNLLGK 


6701 




148S 


AAAGPRTRVRRAAAFEGQPSPSFGLGPTSDKAAAPRTPKRRRLW 
RORQ/HPAMLCYVTRPDAVLMEVEVEAKANGEDCLNQVCRRLGI 
I EVDY FGLQFTGSKGESLWLNLRNR I SQQMDGLAPYRLKLRVKF 
FVEPHL I L0EQTRH I F FLH I KEALLAGHLLCS PEQAVELSALLA 
QTKFGDYNONTAKYNYEELCAKELSSATLNSIVAKHKELEGTSQ 
ASAEYQVLQIVSAMENYGIEWHSVRDSEGQKLLIGVGPEGISIC 
KDDFSPINRIAYPWQN'ATQSGKNVYLTVTKESGNSIVLLFKMI 
STRAAS GLYRAI TETKAFYR CDTVTSAVMMQYS RDLKGKLASLF 
LNENINLGKKYVFDI KRTSKEVYDHARRALYNAGWDLVSRNNQ 
SPSHSPLKSSESSMNCSSCEGLSCOQTRVL0EKLRXLKEAMLCM 
VCCEEE 1NSTFCPCGHTVCCES CAAOLOVGESAAHFCLQPHLSIi 
LLTGSRSOVLAR 


6702 


397 


1S71 


PL1AKFLKI.DLVNVLCLPMEDVFLFYRTCFCSMGLGSSCHLSLPK 
RAEA1,LCSRKATVVRDLVAVRMAEEQEFT0LCKLPAQPSHPHCV 
NNTY RS AQHSQALLRGLLALRDSG1 LFD WLWEGRH I E AHRI L 
LAASCDYFKGMFAGGLKEMEQEEVLIHGVS YNAMCQI LHFI YTS 
ELELSLSNVQETIiVAACOLQI PEI IHFCCDFLMS WVDEENI LDV 
YRLAELFDLSRLTEOLDTYIbKNFVAFSRTDKYRQLPLEKVYSL 
LS S NRLE VS CETE V YEG ALL YHYSLEQVQADQ 1 S L»HE PF KLLET 
VR FPLM EAE VLQRLHD KLDPS PLR DT VASALMY HRNESLQP S LQ 
SPOTELRSDFQCWGFGGIHSTPS\MSSATRPKYLNPLLGEWKH 
FTASLAPRMSNQGIAVLNNFVYLIGGDNNVOGFRTiESRCWRYDP 
RHNRWFQI QSLQQEHADLS VCWGRY I YAVAGR DYHNPLNAVER 
Y DPATNS WAYVAPLKRE VYAHAGATLEG KM Y I TCGRKGR I T 


6703 


4S 


1244 


g vg praaam p lelelc pgr w vggqhpc f i iae1 gonhqgdldva 
krmi rmakecgadcak fokselefkfnr kalsr fyts khswgkt 
ygehkrhlefshdqyrelqryaeevg1fftasgmdemaveflhe 
lnvpffkvgsgdtnnfpylektak/trgwksvlrdvcgvqlnde 
tsswdvlgrvrtsxeic\;lmvlvldysgrpmvissgmqsmdtmkq 

\7~vr\T\7vm WD\TTrr , "CT nrTcavDT riDPrtvrai OUT CFVfilfT.'PPnTD 

igysghetgiaisvaavalgakvlerkitldktwkgsdhsasle 
pgelaelvrsvrlveralgs ptkqllpcemacneklgksvvakv 
xi p5gtiltmdmltvkvgepkgyppedi fnlvgkkvlvtveedd 

TIMSE 


6704 


82 


1007 


TMNTRNRVVNSGI^ASPASRPTRDPODPSGRQGELSPVEDQREG 
LEAAPKGP SRESWHAGORRTSAYTLI APN I NR RNBIQR I AEQE 
LANLEKWXEQNRAKPVHLVPRRLGGSQSETEVRQKQQLQLMQSK 
YK0KLKREESVRIKKEAEEAELQKMKA1QREKSNKLEEKKRLQE 
N-LRREAFREHQQYKTAEFL/ROTEHRIAROKCLSKCCLWPTILN 
MGQKLGLQ\DSLKAEENR KLQXMXDEQHQKS ELLELKRQQQEQB 
RAKIHOTEHRRVNNAFLDRLOGKSQPGGLE0SGGCWNMNSGNSW 
GI 


6705 


2 


786 


RLCRNSARVPCGWSASRSLGEGAGFIGPLRGPHPRAGGTGTSFT 
SYXRXGGIMSTIAAFYGGKSILITVATGFLGKELMEKLFRTSPD 
LKVIYILVRPKAGQTLOKRVFQILDSKLFEKVIEVRPNVHEKIR 
AI YADLNQNDFAISKEDMQELLSCTNI IFHCAATVRFDDTLRHA 
VQLNV TATRQLLLMAS QM PKLE AF I H I ST AYS W CNLKH X DE V I Y 
PCPVEPKKIIDSLEK\LDDAIIDEITPKLIRDWPNIYTYTK 


6706 


130 


531 


FTHSSSSKSQEMLGKLNMLRNDGHFCDITIRVQDKIFRAHKVVL 
AACSDFFRTKLVGQAEDENKNVLDLHHVTVTGFI PLLEYAYTAT 
LSINTENIIDVLAAASYMQMFSVASTCSEFMKSSILWNTPNSQP 
EK 



542 
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SEO 
ID 
NC: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acic: 
residue ot 
amino acic 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino scid segment containing signaj peptide 
<A*Alanine, C-Cysteme, D*Aspart ic Acid, £= 
Glutamic Acid, F=Phcnyl alanine, G^Glycine, 
K=Histidine, I^Isoleucine, K=Lysint, 
L=ueucine, M-Methionine, N=Asparagint , 
P=Proline, Q=Glutsmine, R=Arginine, 
S^Serine, T^Threcnine, VsValine, 
W=Tryptophan, Y=Tyrosine, X^Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 


6707 


2233 


1343 


YWSGIGY ELQHFHWRKFHFEKKGPPSTCQBRLYESRS RWPCIS * 
GMWVGWTAVNGSK*GGOLRCVCVCTSHSSDSTRSSORASKCKS 
FF1LSQ*KT*SSWENVJVFAKYSRIYSYGHSCSKGRGD*DFK*NV 
SQAR*SRFCGLCNPCGHCGLD1MLRGGSSPWTDKKSCVHNNLLC 
NRRVFSLLCEGPGHCYOGAVCKEACAAASPGLDSAAEPHRLCEH 
TD* LPK * GPGY 1 QHFH CDSNI LCI LYN I SFNLFS Y£ F * GVAR YA 
C*RCHWYFEWLLYNHCGDILVACL*RRQL*SSQ 


6708 

i 


115 


1729 


TVGSWSRSGRSPPVGRQLLLTGRGAQAAGSPQGGMALOVELVFT 
GEI IRWHPHRPCKLALGSDGVRVTMESALTARDR VGVQDFVhh 
ENFTS EAAFZ ENLRRR FR ENL I YTY I G PVLVS VN P Y R D 1 .Q I YSR 
OHMERYRGVSFYEEPPHLLAVADTVYRALRTERRDOAVMISVES 
GAGKTDATKRLLQLYAETCPAPQRGGAVRDRLLQSNPVLEAFGN 
AKTIiRNDNSSRFGKYMDVOFDFKGAPVGGKILSYLLEKSRWHQ 
KHGERNFKIFYQLLEGGEEETLRRLGLERNPQSYLYLVKGQCAK 
VSS INDKSDWKWRKALTVI DFTEDEVEDLLSIAASVLHLGNIH 
FAANEESNAQVTTEKQLKYLTRLLSVEGSTLREALTHRKIIAKG 
EELLS PLNLEOAAYARDALAKAVYS RTFTWLVGKIN R S LASKDV 
ES PSWRSTTVLGLLE 1 YGFEVFQHNS FEQFCINYCNE KLQQLFI 
ELTLKSEQEEYEAEG I AWEPVQYFNNKI ICDLVEEKFKGI I \S 1 
LDE\ECXRPGE 


6709 

i 


3 


8S4 


PPHEHLiFPSGERGPFSFLVSRRGLGPGKMGKKGKKEKKGRGAEK 
TAAKMEKKVSKRSRKEEEDLEALIAHFQTLDAKRTOTVELPCPP 
PSPRZjNASLSVKPEKDELILFGGEYFNGQKTFLYNELYVYNIRK 
DTWTKVD1 PSPP PRRCAHOAVWPQGGGQLWVFGGEFASPNGEQ 

fyhykdlwvlhlatktweovkstggpsgrsghrmvavjkrqlilf 
ggfhestrdy i y yndvyafnldtftwsklspsgtgptfrsgcqN 
ipslipraassvyggyskqrvkkdvdkgtrhsdmf 


6710 


1S8 


980 


RHKMTNYRVESS SGRAARKMRLALMGPAFIAAIGY 3 DPGNFATN 
1QAGASFGYQLLWVVWANLMAML3 Q 1 LSAKLG I ATG KNLASQI 
RDHYPRPVVWFYWVQAEIJAMATDLAEFIGAA1GFKLILGVSL1, 
QGAVLTGI ATFLI LMLC-RRGC-XPLEKV I GGLLLFVAAAY I VE1»I 
FSQPNXiAOIrGKGM VI PSL PTSEA VFLAAGVlAGAT I >3 PHVI / YI 
WESSLTQHLHGGS ROORY SA7KWDVA I AMTIAGFVN1A 1 MATAA 
SELNFYGHTGVA 


6711 


3 


347 


VTECKTMTCKMSQLERNI * TMINTLHHYSVKLGHPDTL3HGEPK , 
ELVRTDLHN I LMKENKNDQA I * H I MEDLDTNAHMQ 1 1 FKELIML . 
MAMLTWSYHDNMHDADYGPGQOHRPG 


6712 


118 


578 


PHGQKRTRYPQVRAPG0OP0AQLAMALCLKQVFAKDKTFRPRKR 
FEPGTQRFELYKKAOASLKSGLDLRSWRLPPGENI DDWIAVHV 
VDFFNRINLIYGTMAERCS*TSCPVMAGGPRYEYRW02ERQYRR 
PAKLSAPRYMALLMDWIESLI 


6713 

i 


2485 


3 


QARGSDSEDGEFEIQAEDDARARKLGPGRPLPTFPTSECTSDVE 
PDTREMVRAQNKKKKKSGGF0SMGLSYPVFKG3MKKGYKVPTPI 
QRKTIPVILDGKDWAMARTGSGKTACFLLPMFERLKTHSAQTG 
ARALILSPTRELALQTLKFTKELGKFTGLKTALILGGDRMEDQF 
AALHENPDI I IATPGRLVHVAVEMSLKLQSVEYWFDEADRIjFE 
MGFAEQLQE1 1 ART iPGGKQTVl»FSATLPKLLVEFARAGLTEPVL 

irldvdtklkeolktsfflvredtkaavli^lhnvvrpqdqtv 
vfvatkhhaeylte1xtt0rvscahiysau)ptarkinlakftl 
gkcstlivtdiiaargldiplldnvinysfpakgklflkrvgrva 
ragrsgtayslvapdeipylldlhlflgrsltlarpbkepsgva 
gvzx5mlgrvp0s\^deei)sgl0stleasleij?gi^vad?jaq0q 

YVRS R PA PS P ES I KRAKEKDLVGLGLH PLFSSRFEEE E LORLR L 
VDS I KNYRSRATI FEINASSK DLCSQVMRAKRQKDR XAIARFQQ 
GQOGRQEQOEGPVGPAPSRPAliQEKOPEXEEEEEAGESVEDIFS 
EWGRKRQRSGPNRGAKRRREEARQRPQEFYIPYRPK2FDSERG 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid secment containing signal peptide 
(A=Alanine, OCysteine, D=Aspartic Acid, E*= 
Glutamic Acid, F= Phenyl alanine, G=Glycine, 
H=Histidine, I =Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
5=Serine, T=Threonine, V= Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
^possible nucleotide insertion) 








LS I SG EGG A FSQQAAG AVLDLMGDE AQNLTRGRQQLKWDR KK KR 
FVG QSGQEDKKKI KT E S GRY I SS S Y KRDLYQKW KQKQK I D * S * L 
GRRRGILTRRRPRTEEVGEARPLAQAGCIPGPHAPRKPLQAESA 
LELKTKQQ3 LKQRRRAO KAALSLQRWWPQAALCPQ 


6714 


165 


1416 


NNCQELLPPPPAPMAH Z PSGGAPAAGAAPMGPQYCVCKVELS VS 
GQN LLDRDVTS KSD P F CVLFTENNGRW I E YDR TETA1NNLN P AF 
SKK F VLDYHFEE VQ KL K FAL FDQDKS SMRLDEHDFLGO FS CSLG 
TIVSSKKITRPLLLLNDKPAGKGLITIAAQELSDNRVITLSLAG 
RRLDKKDLFGKSDPFLEFYKPGDDGKWMLVKRTEVIKYTLDPVW 
KPFTVPLVSLCDGDMEKP I QVMCYD YDNDGGHDF I GEFQTS VSQ 
MCEARDSVPLEFECINPKKQRKKKKYKNSGIIILRSCKINRDYS 
FLDYILGGC0LMFTVG1DFTASNGNPLDPSSLHYINPMGTNEYL 
S AI WAVGQI 1 QDYBSDKMFPALG FG AQLPP DWKVSHEFAI NFNP 
TNPFCSGVDGIAOAYSACLP 


6715 


32 


493 


G PAG AESGS LHCLP ATVQ ALAGAAH S PHGGQP PRRGPL 3 G SGMP 
GKPKHLGVPNGRMVLAVSDGELSSTTGPCX5QG2GRGSSLSIHSL 
PSG PS S P FPTE EQP VA S W ALS FERLLQDPLGLAY FTEFLKKE PS 
AENVTFWKACERFQQ3 PASDT 


6716 


1 


176 


GAGG P APRS FG SEE PRAALERDKMS ARAAAAKS TAMEETA I W EQ 
HTVTUIRVSLCCSK 


6717 


115 


896 


L'FAMSGFENLNTDFYOTSYSIDDQSQQSYDYGGSGGPYSKQYAG 
YDYSO0GRFVPPDMM0PQQPYTGQ1YQPTQAYTPASPQPFYGNN 
FEDEPPLLEELGINFDKI W0KTLTVLHPLKVA1X5SI MNETDLAG 
P MVFCLAFGATLLLAGK3 QFG YVYG I S AI GCLGMFCLLNLMSMT 
GVSFGCVASVLGYCLLPKILLSSFAVIFSLQGMVGI ILTAGI IG 
WCS FS AS K3 FI S ALAMEGQQLLVAYPCALLYGVFALI SVP 


6718 


29C 


599 


KQSSTVPGT3LPSLKWKNSGLCKFPETGGKMTTFKEGLTFKDVA 
VIFTEEELGLLDPVQRNLYQDVMLENFRNLLSVGHHPFKHDVFL 
LEKEKKLDIMKTATQ 


6719 


1 


691 


P'J'RPESQDREDGKCHKI^EMNPISGNLNCDPIAMSOCSSDHGCET 
DLDSBDDKIEKPNNFMKDSASQDNGLSRKISRKRVCSSDSDSSL 
QWK K S S KARTGLLR I TRR CAATAANK I KLMSDVEDVSLENVHT 
RSKNGRKKPLHLACTTAKKKLSDCEGSVHCEVPSEOYACEGKPP 
DPDSEGSTKVLSOALNGDSDSEDMLNSEHKHRKTNIHKIDAPSK 
RKSSSVTSSC- 


6720 


3 


B22 


HEVAEEAGGTV Y PQRGTMPGTKRFQH V I ETPE PGKWELTGYEAA 
VPITEKSNPLTQDLDKADAENIVRLLGOCDAEIFOEEGOALSTY 
QRLYS ES I LTTMVQVAGKVQEVLKEPDGGLVVLSGGGTSGRMAF 
LMSVSFNOLMKGLGQKPLYTYLIAGGPRSWASREGTEDSALHG 
I EELK KVAAG KKRV I V2 G I S VGLS APFVAGQMDCCMNNTAV FLP 
VLVGFNPVSMARHPFPPPRILRSLTVFPSLRAPHYQITSLLFSM 
SWTLISE 


6721 


3 


822 


HEVAEEAGGTVYPQRGTKPGTKRFOHVIETPEPGKWELTGYEAA 
VPITEKSNPLTODLDKADAENIVRLLGQCDAEIFQEEGOALSTY 
QRLYS ES I LTTMVQVAGKVQEVLKEPDGGLVVLSGGGTSGRMAF 
LMSVSFNQLMKGLGQKPLYTYLrAGGDRSWASREGTEDSALHG 
I EELKKVAAGKKRVI VI G I S VGLS AP FVAGQMDCCMNNTAVFLP 
VLVGFtf P VSMAK HPFPPPR J LRSLTVFPSLRAPHYOI TSLLF5M 
SWTLISE 


6722 


1 


390 


RSWSKRTWOALPMAVLFLLLFLCGTPQAADNMQAIYVALGEAVE 
LPCPS PSTLHGDEKLSWFCS PAAGS FTTLVAQVC VGR PAPDPGK 
PGRESRLRLLGN^'SLWLEGSKEEDAGRYWCAVLGOHHNYONW 


6723 


173 


659 


VCQY CTARMAD FG I SAGOFVAWWDKSSPVEALKGLVDKLOALT 
GNEGRVSVENI KQLLQS AH KESS FDI I LSGLVPGS TTLHS AE IL 
AEIARILRPGGCLFLKEPVETAVDNNSKVKTASKLCSALTLSGL 
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S20 
2D 
NO: 


Predicted j Predicted end 
beginning i nucleotide 
nucleotide i location 
location i correspcnding 
corresponding ■ to first 
to first j amino acid 
amino acid i residue of 
residue cf ' amino acid 
amino acid . sequence 
sequence 


Amino acic segment containing signal peptide 
(A«Alanine, C^Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histicine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N^Asparacine, 
P=Proline, C=Glutamine, R=Arginine, 
S^Serine, T=Threonine, V= Valine, 
W=rryptophan, Y=Tyrcsine, X^Vnknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion} 






VEVKELQKEPLTPEEVOSVREHLGHESDNL 


6724 


173 


659 


VCQYCTARMADFG1 SAGQFVAWWDKSS PVEALXGLVDKLCALT 
GNEGRVS VEK I KQLLQS AHKES S FDI 2 LSGLVPGSTTLH S AE3 L 
AEIARILRPGGCLFLKEPVETAVDNNSKVKTASKLCSALTLSGL 
VEVKELQRE PLTPEEVQSVREHLGHESDNL 


6725 


356 


722 


RRRTP F V I liATM DDDLMLALR LQE E WN LQE AER DHAQ E S LSLVD 
ASWELVDFTPDLOALFVQFNDOFFWGCLEAVEVKWSVRMTLCAG 
3CSYEGKGGMCSIRLSEPLLKLRPRKDLVEVFFV 


6726 


98 


714 


H LQKMERK 3 NR 3 EK EKE Y EG KHNS LEDT DQGKNCKSTLMTLNVG 
G YLY I TQ KQT LT KY PDTFLEG I VNGK 1 LCP FDADGHY F I DRDGL 
LFRlWLNFLRNGELLLPEGFRENQbLAOEAEFFOLKGLAEEVKS 
RWEKEQLTPRETTFLEITDNHDRSQGLK1FCNAPDFISKIKSRI 
VLVSKSRLDGFPEEFS 3 SSTJI 3 QFKYFI K 


6727 


1 


83: 


FRGMGDERFHYYGKKGTPQKYDPTFKGP3YNRGCTDI3CCVFLL 
LAIVGYVAVGIIAWTHGDPRKVIYPTDSRGEFCGOKGTKNENKP 
YLFYFWIVKCASPLVLLEFQCFTPQICVEKCPDRYLTYLNARSS 
RDFEYYKCFCVPGFKNNKGVAEVLRDGDCPAVLIPSKPLARRCF 
PAIKAYKC-VLMVGNETTYEDGHGSRKNITDLVEGAKKANGVLEA 
RQLAMR3 FEDYTVSWYWDI I SLGI AMAKSLLFI ILLRFLAG3MG 
RGM I I MG I LVLG Y 


6728 


486 


935 


FCSSWLRSLADSSLSWKMFLVGLTGGIASGKSSVIQVFQOLGCA 
V3DVDVMARKWOPGYPAHRR3VEVFGTEVLLENGDINRKVLGD 
L3FN0PDRR0LLNAITHPEIRKEMMKETFKYFLREPRTSPRGKK 
HVPSALKEADSLMRRDT 


6729 


259 


1192 


VGLTGAQS GR TASMGRDQRA VAGPALRR WJjLLGTVTVGFIACS V 
LAGVKKFDVpCGGRDCSGGCOCYPEKGGRGQPGPVGPOGYNGPP 

glogfpglogrkgdkgergapgvtgpkgdvgargvsgfpgadgi 
pghpgqggprgrpgydgcngtogdsgpogppgsegftgppgpqg 
pkgqkgepyalpkeerdryrgepgepglvgfogppgrpghvgqm 
gpvgapgrpgppgppgpkgoognrglgfygvkgekgdvgopgpn 
gi psdtlhp 1 1 aptgvtfhpdqykgekgsegepg i rg i slkgse 

GIM 


6730 


784 


1015 


NMVDY Y EV LG LOR Y AS P E D I KXA YHK VALKWHPD KN P EN K E E AE 
RKFKEVAEAYEVLSNDEKRDIYDKYGTEGLNEF 


6731 


1 


446 


GIRKRLKGAWPRVEVGCPWETRESEGVKLERPTSPLKNNDEGS 
LDIYAGLDSAVSDSASKSCVPSRNCLDLYEEILTEEGTAKXATY 
NDLO VE YG KCCLOMKELMKKF KEI QTON FSLIKENOSLKKNr SA 
LIKTARVE1NRKDEEI 


6732 


Z02 


2205 


GRWORRPPPPSPPLWCLQPGGGSDPQQLTQLRHCLSHSPQDl'PW 
AQROVCYTAATTOAAAPATRNCLPDKSGHRPTPPRSHRHHRQEN 
LGSIKPSSRSTKATSTimGDGRRAEAVREGWGVYVTPRAPIRE 
GRGRLAPQNGGSSDAPAYRTPPSRQGRREVRFSDEPPEVYGDFE 
PLVAKERS P VGKRTR LEE FRS DS AKEE VR E S AY YLR S RQRRO PR 
PQETEEMKTRRTTRLQQQHSEOPPIjQPSPVMTRRGLRDSKSSEE 
DEASSQTDLSQTISKKTVRSIOEAPAVSEDLVIRLRRPPLRYPR 
YEATSVQQKVNFSEEGETEEDDQDSSHSSVTTVKARSRDSDESG 
DKTTRSSSQYIESFW 


6733 


613 


1311 


RSCRCVGMKSRNOGGESASDGK1 SCPKPS: IGNAGEXSLSEDAK 
KKKKSNRKEDDVMASGTVKRHLKTSGECERKTKKSLELSKEDLI 
QLLSIMEGELOAREDVIHMLKTEKTKPEVLEAHYGSAEPEKVLR 
VLHRDA I LAO E K S 3 G E DVY E KP 1 S ELDRLE EKQKET Y RRMLEQL 
LIjAEKCHRRTVYELENEKJiKKTDYMNKSDDFTTTLLEOERERLKK 
LLEQEKAYQARKE 


6734 


189 


551 


SAAMFPVFSGCFOELQEKNKSLELVSFEEVAVHFTWEEWQDLDD 
AQRTLYRD VMLETYS SLVSLGHCI TKPEMI FKLEOGAEPWI VEE 
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SEQ 
3D 
NO: 


Predicted 
beginning 
nucleotide 
loca t ion 
corresponding 
to first 
amino scic 
residue cf 
amino acid 
sequence 


Predicted end 

nucleotide 

location 

correspondi no 

to first 

amino acid ! 

IcSluUc OI 

amino acid 1 
sequence 


Amine acid seatnent containing signal peptide 
(A=Alanine, C=Cysteine, D=>.spartic Acid, E= 
Glutamic Acid, F« Phenyl alar, me , G=Glycine, 
H-Histidine, 3=Isoleucine, K-Lysine, 
L^Leucine, M=Methicnine, N-Asparacine, 
P*?roline, Q-Glutamine, R^Arginine, 
j-oeiiiici leinrcoiiinc, v = vcj— ne, 
W=Tryptophan, Y=Tyrosine, >:=Unknown, +=Stop 
Codon, /-possible nucleotide deletion, 
^possible nucleotide insertion) 








TLNLRLSGGSKKCVFSGICKRSLVSLOHVHbV 


6735 


28C 


558 


KisKRAbV HvJ-lSNPr LKyVr MUJKl tRl KKKfr bPGiyKr LLHKKA 
CASLNAGLDLRLAVOLPPGEDLNDVA/AA'HWDFFNRVl^LIYGTI 
XDGCT 


6736 


1 Si- 


808 


MNYELNFKREMPNJKSLGLTKLNFLLKKLSSVLPLlTDYVyFEN 
£SSNPYLIRRIEELNKTASGNVEAKV\'CFYRRRDISNTLIMLAD 
KKAKEIEEESETTVEADLTDKOKHQLKHRELFLSROYESI.PATH 
IRGKCSVALLNETESVLSYLDKEDTFyVSLVYDPSLKTLLADKG 
EIRVGPRYQADIPEMLLEGTFFCVFAVj. 


6737 


ISO 


1209 


PVIMPbHFSPGDIVKPSCCVSSSPKLK.nNAHSRLESYRPDTDLS 

redtgcnlohisdreniddlnmefnphdhprastiflsksqtdv 
rekrkslfinhhppgoiarkysscstiflddstvsopnlkytik 
c valai y yh i knrd pdgrmlld 1 fdenlh pls kse v p pdydkhn 

PEOKOIYRFVRTliFSAAQLTAECAIVTl-VYLERLLTYAEIDI CP 
AN W KRIVLGA 1 LLAS KVWDDQA VWNVD Y CO I UCDI TVEDMNELE 
ROFLELLOFNINVPSSVYAKYYFDLRSLAEANNLSFPLEPLSRE 
RAHKLEAISRbCEDKYKDLRRSARKRSASADNLTbPRWSPAIIS 


6738 


14£ 


653 


CACAEQPAPJvEVGAATAIjPWiWASGE^-PSGSLAVPLAVLVLIjL 
WG AP WTHGR R SNVR VI TDEN WRELLEG D WMI E FY A P W C P ACQNL ■ 
OPEWESFAEWGEDLEVNIAKVDVTEQPGLSGRFIITALPTIYHC 
KDGEFRRYQGPRTKKDFINF3 SDKEWK. C 3 EPVSSWF 


6739 


3 


631 


S W P DMAEE EVA KLE KH LM LliRQEYVK LO KKLAETE KRC ALLAAQ 
ANKESSSESFISRLLAIVADLYEQEQY^DLKI KVGDRK: SAHKF 
VLAAR5DSWSliANLSSTKELDLSDANPFVTMTMbRWIYTDELEF 
REDDVFLTELMKIiANRFQLQLLPJSRCEKGVKSLVNVRNCIRFYQ 
TAEELNASTLMNYCAE 1 1 AS; 1WVSEVEGVNKAL 


6740 


3 


631 


SWPDMAEEEVAKbEKHLMLLRQEYVKLC KKJjAETEKRCALLAAQ 
ANKESSSESFI SRLLA3 VADLYEQEQ Y E DLK I KVGDRH I SAHKF 
VLAARSDSWSLANbSSTKEbDLSDANPFVTKTMLRWIYTDELEF 
REDDVFLTELMKLANRFQLObLRERCE KSVMSLVNVRNCI RFYQ 
TAEELNASTLMNYCAEI IASHWVSEVEGVNKAL 


6741 


141 


960 


PLTLPFSSRARAGHTMNTSPGTVGSDPVJLATAGYDHTVRFWQA 
HSG I CTRTVQHQDSQVNALEVTPDRSM} AAAVQP VSLGYQH IRM 
YDLNSNNPNP1 1 S YDG\WKNI ASVGFHEDGRWMYTGGEDCTARI 
WDLRS RNLQCQR I FOVNAP I NCVCLHPK OAELI VGDQSGA I H I W 
DLKTDK N EQL 1 PE PEVS ITS AH I DP DA .V- V MAA VN S TLV P FS CLL 
PLAIGlLQEGEFESLAPRGLLFLACQGKCYVWNLTGGIGDEVrQ 
LIFKTKIP 


6742 


141 


960 


PLTLPFSSRARAGHTMNTSPGTVGSDPVj LATAGYDHTVRFWOA 
HSG 3 C7RTVQKQDSQVNALEVTPDRSM 3 AAAVQP VSLGYQH IRM 
YDLNSNN PNPI ISYDGVNKKI ASVGFH EDGR WMYTGGEDCTAR I 

WDLiRS RN Uj COR 1 r Q VNAF I N CVCLn FN wAbLii Via JJUbbA 1H1N 
DLKTDHNE0LIPEPEVSITSAHIDPDASY14AAVNSTLVPFSCLL 
PL AIG I LQEGEFESLAPJRGLLFLACQGN C YVWKLTGGI GDEVTQ 
LIPKTKIP 


6743 


1 


412 


MHSTODKSLHLEGDPNPSAAPTSTCAPR KMPKRI S I SKQLASVK 
ALRKCS DLEKAI ATTALI FRN S SDS DG KL>E KAI AKDLLQTQFRN 
FAEGQETKP KYRE I LS ELDEHTENKLD F t D FMI LLLS I T VMSDL 
LQNIR ' 


6744 


95 


1343 


RTP ARNR CAGCE VLS R FSS PN KAS S FALC'S AGGGLPA VRALRRD 
R0KVSTVGYGMDEVEQDQHBARLKELFDSFDTTGTGSLGOEELT 
DLCHMLSLE2VAPVLQQTLL0DNLLGRV>: FDQFKEALI LIliSRT 
LSNEEHFQEPDCSLEAQPKYVPGGKRYGKRSLPEFQESVEEFPE 
VTVIEPLDEEARPSHIPAGDCSEHWKTOKSEEYEAEGQLRFWNP 
DDLNASQSGS S PPQD W I EE KLO E VCEDLG I TRDGH LNR KK1-V S I 
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SEO | 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acic 
residue ol 
amino acic 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine , D=Aspartic Acid, E= 
Glutamic Acid, F= Phenylalanine, G=Glycine, 
H=Histidine, I^Isoleucine, X=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P^ProIine, Q=Glut amine, RsArginine, 
S=Serine, T*Threonine, V=Valane, 
W=Tryptophan, Y^Tyrosine, X^Unknown, *^Stop 
Codon, /^possible nucleotide deletion, 
\=possibie nucleotide insertion) 








CEQYGijONVDGEhijE&VrhJYLDPtA^^ Xobr KNCjKJbljI 
P S AST P Y RQLKRH LS MQS FDE SGRRTTTS SAMTST I G FR VFS CL 
DDGMGHASVERILDTWQEEGIENS0E1LKALDFGLDGNINLTEL 
TLiALbNE LLV1 KNS I HQAL I 


6745 


a 


588 


TFRDOGWAORRKWLLGCASWESWEAAIAAGPGbPSSTAROONNP 
AAGTEC r AAVWARGTAMGSVLSrDSGXS APASATARALERRRDP 
ELPVTSFDCAVCLEVLHQPVRTRCGKVFCRSC1ATSLKNNKWTC 
PYCRAYLPSEGVPATDVAKRMKSEYKNCAECDTLVCLSEMRAHI 
RTCQKYIDKYGPLQELSETA 


6746 


110 


492 


GATGAMAESAPARHRRKRRSTPLTSSTLPSQATEKSSYFCTTEI 
S LWTWAAI QAVEK KMESCAARLQSLEGRTGTAEXKLADCEKMA 
VEFGNOLEGXWAVLGTLLQEYGLLQRRLiENVENLLRNRN 


6747 


24 7 


481 


EAVTFKDVAWFTEEELGLLDLiAQRKLYRDVMLENFRNLLEVGH 
QPFHRDTFHFLREEKFWMMDIATQREGNSVYAGVC 


6746 


201 


665 


MTT FKEAVTFXDVAW FTEEELGLLDPAQRKLYRDVMLHNFRNL 
LS VGNQP FHQDTFK FLG KE KFWKMKTTS QREGNSGGXI Q I E MET 
VPEAGFHEEWSCQQ I WEQI ASDLTRSQNS I RNSSQFFKEGDVPC 
Q I EARLS I S X VQQX P YRCNECKQ 


6749 


95 


719 


RREV KGGDGVCPRARGS PQSQQFP SCAGGGEGLQOSGEALDGAM 
SAGGPCPAAAGGGPGGASCSVGAPGGVSMFRWLEVLEKEFDKAF 
VDVDLLLGEI DPD0API TYEGRQXMTSLSSCFAQLCHKAOSVSQ 
INHKLEAQLVDLKSELTEI'QAEKVVLEKEVHDOLLQLHSIQLQL 
HAKTGQSAOSGTI KAKLSGPSVEELERELKAN ■ 


6750 


3 


42£ 


SCESRRPGAKWVWASGALPRnTTGLGSEQPSGDVAQSNRATMGT 
TAPGPlHLLELCDCKLMEFLCNMDNKDLiWILEEIQEEAERMFTR 
EFSKEPELMPKTPSOKNRRKKRR I SYVQDENRJDPIRRRLSRRKS 
RSSQLSSRR 


6751 


152 


1417 


PTKATEMAGASVKVAVRVRPFNSREMSRDSKCIIQMSGSTTTIV 
N PKQPKETP KS FS FD YS YW SHTS PED 1 NY AS QKQV YRD I GB EML 
0KAFEG YNVCI FAYGOTGAGKS YTMMGKOEXDQQGI 1 PQIATEDL 
F SR I NDTTNDNMS Y S VE VS YME I Y CE R VRDLLNP KNKGNLR VRE 
HPLLGPWEDLSXLAVTSYND1QDLMDSGNKARTVAATNMNETS 
SRSHAVFNI 1 FTQKRKDAETNITTEKVSK2 SLVDLAGS ERADST 
GAXGTRLKEGANINXSLTTIX3KV3SAIAEKDSGPNKNKKKKKTD 
FIPYRDSVLTWLLRENLGGNSRTAMVAALSPADINYDETLSTLR 
YADRAKOI RCNAVI NEDPNNXLI RELXDEVTRLRDLLYAOGLGD 
I TDWTNALVGMS PS SSLSALSSRNV 


6752 


24 


1834 


RNCVPPLGCYRSRVKFHSDIXMQYSKHCEHLLERLNKOREAGFL 
(ZDC7TJVIGEF0FKAHRNVLASFSEYFGAIYRSTSENNVF1»D0SQ 
VKADG FQKLLEFI YTGTLNLDSWNVKE IHQAADYLKVEEWTKC 
Kl KNEDFAFI ANPS STE I SS ITGNI E LNQQTCLLTLRDYNNREK 
SEVSTDuIOAKPKOGALAKKSSOTKKKXXAFKSPKTGQNKTVQy 
PSDILENASVELFLOANKLPTPWE0VAQINDNSELELTSWEN 
TFPAQD I VHTVTV XRXRGKS QPNCALXEH SMSNI ASV KS P Y E AE 
NSGEEL DQR YS KAK PMCNTCGKVFSEAS S LRRHMR IHKG VK P YV 
CHLCGKAFTOCNQLXTHVRTHTGEKPYKCEL.CDKGFAQKCOLVF 
H QRMHK fiFFKF YKC nVPNT/lF ATS SN LK I H AR KHSGEKP YVCDR 
CGQR FAOASTLTYKVRRHTGE KP YVCDT CGKA FA VS S S L I THS R 
XHTGEKPF I CELCGNSYTDI Kl^LKKHXTKVKSGADKTLDSSAED 
HTLSEODSIOKSPLSETMDVKPSDMTLPLALPLGTEDHHMLLPV 
TDTOSPTSDTLLRSTVNGYSEPQLIFLQQLY 


6753 


2 


1305 


VPSLPYPPOKWARTEFTTSSDSETANGIAKPDPVMPGGEEXAS 
PFGIKLRRTNYSLRFNCDQQAEQKKKXRHSSTGDSADAGFPAAG 
SARGEKEMEGVAIiKHGPSLPQERKQAPSTRRDSAEPSSSRSVPV 
AHPGPPPASSOTPAPEHDKAANKMPLAOKPALAPKPTSOTPPAS 
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BNSOOCID: <WO 0153312A1_L: 



WO 01/53312 



PCT/USO0/34263 



SEQ 
ID 

NO: 


Predicted 
bee inning j 
nucl eotide 
j oca tion 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted enc 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
secuence 


Amino acid segment containing signal peptide 
(A=Alanine, CsCysteine, D*Asp£rtic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
K=Histidine, I^Isoleucine, K=lysine, 
L=Leucine, M=Methionine , l^Asparagine , 
?=Prca ine, O-Glutamine, R=Arginine. 
S=Serine, T=Threonine, V=Valint, 
VJ= Tryptophan, Y=Tyrosine, X=Unknown, *«Stop 
Codon, /=possib!e nucleotide deletion, 
\=possible nucleotide insertion) 






• 


PLSKLSRPYLVELLSRRAGRPE>PEPSEPSKEDQESSDRRPPSPP 
GPEERKGQKRDEEEEATERKPASPFLPATGOEKPSQTPEAGRKE 
KPMbOSRHSLDGSKLTEK^TAQPLWITLALOKOKGFREQOATR 
EERKCAR EAKQAEKLS KENVSVS VQPGSS SVS RAGS LH KSTALP 
EEXRPE7AV SRLERREQLKXANTLPTSVTV:: I SYSSPAAPLVKE 
VSKRFSSPDDAPVSSEPAWLALAKRKAKAWEDCPLIIK 


6754 




413 


P VRRRRRRLGGPEVNTMSSLHKSRI ADF0UV1,KEPS 1 AliEKLRE 
LSFS63 PCEGGLRCLCWKXLLNYLPLERASKTS ILAKQRELYAQ 
FLREM1 IQPG1 AKANMGVSREDVTFEDHPLKPNPDSRVfNTYFKD 
NEVLL 


6755 


296 


1343 j ?GLQLOVALEADWFLDMPGGRRGPSRCX?LSRSALPSLQTLVGGG 
! CGNGTGLRNRNGSAIGLPVPP1TALITPGPVRHCQIFDLPVDGS 
| LLFEFLFFIVLLVALFIQVINlYXTWQWypyKHPASCTSLNFHL 
1 3 DYHLAAFI TVMLARRLVWAL1SEATKAGAASMIH YfWLI SARL 
j VLLTLCG W VLCWTLVNLFR SHS VLNLLFLG Y P FGV Y VPLC CFHQ 
DS RAHLLLTDYN YVVQHEAVEESAS TVGGLAK S KDFLSLLLESL 
| KEQFNMATPI FTHSCFLS PDLIRNEVECLKADFNHRIKEVLFNS 
j IFSAYYVAFLPLCFVKVSGVLTFMCFLDLCVICYINWVFLV 


6756 


180 


754 ( IERALGSLPLSIPVSWGSIiRTLKYQQQPLRPKVLLCQTRVOCHD 
1 LRS LQPQ PPGL KQS FCLR VLGLQTG ATTPG LR DLTC KEL 1 1 LTE 
I REAQXRKKRKEKESGMALTCGPLTFRDVAIEFSQEEWKSLDPVQ 
| KALYWDVMLENYRNTLVFLGKDNFALEVK3CFRVFLYFLCCLSWE 
| PFHYLTETEALLTHK 


6757 


2 


459 


KSRVJBAPEAHSRESOGSDAMRKHLSWWW1ATVCMLLFSHLSAVQ 
TRG 3 KHR 1 KWNR KALPSTAQI TEAQVAENRPGAF 1 KQGRKLDID 
FGAEGNRYYEANyWQFPDGIHYNGCSEANV^KEAFVTGCINATQ 
AANQGE FCKPDKKLHQQVLW 


6758 


j- 


iooe 


A SG PEli PGRRFR DRAP WLP ARLLRG V1AVW VS LSALG P G S FCRR 
RVPSLAQbGHSEAAPSPDDVRWSRVPDRCPEERDRAWPPPPPPS 
L PPS FR RN MANN £ P ALTGNS 0PQHOAAAAAACOQQ0 CGGGGATK 
PAVSGKQGNVLPLWGNEKTMNLNPK1 WHILES PYFKVQLYELK 
TYHEWDEIYFKVTHVEPWEKGSRKTAGQTGMCGGVRGVGTGGI 
V S TAFCLLY KLFT L KLTRKQVMGLI TKTDS PY 3 RALGFMY3RYT 
OPPTDLWDV7FESFLDDEEDLDVKAGGGCVMT3GEMLRSFLTKLE 
W FSTLFPR 1 PVPVQKNIDQQ1 KTRPRKI 


6759 


I 


513 


R KHNFHS LDGTSTRAFHPQTGLPLLS S P VPOR KTOSGCFDLDS S 
LLHLKSFSSRSFRPCLNIEDDPDIHEKPFLSSSAPPITSLSLLG 
NFEESVLN YR FDPLG I VDG FTAEVGASGAFCPTHLTLPVEVS FY 
S V SDDNAP S PYMGV I TLESLGKRG YRV P P SGT I QW CVL 


6760 


239 


606 


VL-S KKKGL S AEEKRTRMMEI FSETKDV FQLKDLEKI APKEKG I T 
W -iS VKEV LOS LVDDG MVDCER1 GT SNY YW AF ? S KALHARKHKLE 
VLESQLSEGSQKHASLQKSIEKAKIGRCETEERT 


6761 


2S> 


1733 


ER TLRGLRE V AAP S DVADAAVS RRGR CCCCLH CTQTQV AQDCP S 
SSSSVQRCELSLFQSLHTMTSKKLVNSVAGCADDALAGIiVACNP 
NLCLLOGH R VALRSDLDS LKGRVALLSGGGSGH EP AHAGF I GKG 
ML7G VI AGAVFTS PA VGS I LAA I RAVAQAGTVG TLL 3 VKNYTG D 
RLE FGLAREQARAEG 3 PVEMVVI6DDSAFTVLKKAGRRGLCGTV 
L I K KVAGALAEAGVG LEE I AKQVNWT KAMGTLGVSLSSCSVPG 
S K PTFEL£7\DEVELGliGI HGEAGVRR I KMATADE 1 VKLMLDHMT 
NTTNASHVP VQPGSS WMMVNNLGGLS FLELG 3 I ADATVRSLEG 
RGV K.XARALVGTFMSALEMPG3 SLTLLLVDEPLLKLI DAETTAA 
AWPNVAAVS 3 TGRKRSRVAPAEPOEAPDSTAAGGSASKRMALVL 
ERVCSTLLGLEEHLNALDRJ^GDGDCGTTHSRAARAIOEMLKEG 
PFFASPAOLLSKLSVLLLEKMGGSSGALYGLFLTAAAOPLKAKT 
SL t AWS AAMDAGLEAMOKYGKAAPGDRTMLDS LW AAGQEL 
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BNSDOCID: <WO 0153312A1 J_» 



WO 01/53312 



PCT/US00/34263 



SEQ 
ID 
NO: 


Predicted 

nucleotide 
locatior. 
corresponding 
to firs; 

ami rn api p 

residue of 
amino acid 
sequence 


Predicted end } Ammo acid segment containing signal peptide 
nucleotide | (A=Alamne, C=»Cysteine, D— Aspartic Acid, E= 
location 1 Glutamic Acid, F=PhenylaIenine , G=Glycine, 
corresponding | K^Histidine, 1-Isoleucine, K=Lysine, 
to first | L=Leucine, M=Methionine, N=Asparagine, 
amino acid | P=Proline, Q=Glutamine, R-Arginine, 
xtzbJLuuts vji - | ^-oeriuc, 1 - jjireoniue , v-vaj ine , 
amino acid < W= Tryptophan, Y^Tyrosine, X=Unknown, *=Stop 
sequence ! Corion, /^possible nucleotide deletion, 
] \=possible nucleotide insertion) 


6762 


2 


623 I A57ISWRLCVAGAEARRPVPVAGERAGCGAMWFMYLLSWLSLFI 
| 0VAFITLAVAAGLYYLAELIEEVTVATSR1 1 KYMIWFSTAVLIG 
1 LYVFERFPTSMIGVGLFTNLVYFGLLOTFPFIMLTSPNFILSCG 
| L\nrVNHYLAFOFFABEYYPFSEVLAYFTFCLWl IPFAFFVSLSA 
| G E NVL PSTMQPGDD WSN Y FTKG KRG K 


b/oJ 




760 


SG FD FPGRR FRGCCCVRP PAGAGMELGGKWDMNS APRLVS ETAE 
RKQEQKTGTEAEAADSGAVGARRFLLCLYLGGFLDLFGVSMWP 
LLS LHVKS LGAS PTVAG I VGSS YG I LOLFSSTLVGCWS DWGRR 
£ S LLAC I LLS ALGYLLU3AATNYFLFVLAK VPAG I FKHTLS ISK 
ALLSDWFEKERPLVIGHFNTAJSGVGFTLGPWGGYI.TELEDGF 
YLTAFICFTjVFILNAGLWJFFPRREAKPGSTE 


6764 


80 


43E 


LKKMDTWLSVRNLFEQLVRRVEI LSEGNEVQF IQLAKDFEDFR 
KKWQRTDHE LG KY KDLLMKAET ER S AljD VKL KI1ARNQVD VE I KR 
RQ R AEADCEKLERQ3 QLI REMLNCDTSGS 10 


6765 


3 


550 


AK Y S RVDH F CRRR CRAVARAFR FLLQ F PSG P S RH FLAAC V ARWL 
RGSVliVSEALSGSAWDGIVTEVAVGVKRGSDELLSGSVLSSPNS 
NMSSMWTANGNDSKKFKGEDKNDGAPSRVLHIRKLPGEVTETB i 
V1ALGLPFGKVTNILMLKGKN0AFLELATEEAAITNGNYYSAVT | 
PKLRNQ 


■ 67*6 




1287 


EC-GSFKASLTWLWPLGEMKLHCEVEVI SRHLPALGLRNRGKGVR ; 
AVLSbCQOTSRSOPPVRAFLLISTLKDKRGTRYELRENlEQFFT j 
KFVDEGKATVRLKEPPVDICLSKANSSSLKGFLSAMRliAHRGCN 
Vr>TPVSTLTPVKTSEFENFKTKN\ f ITSKKDYPLSKNFPYSLEHL 
QTSYCGLVRVDMRMLCLKSLRKLDLSHNHIKKLPATIGDLIHLQ 
ELNLNDNHLBS FS VALCHSTLQKS LWSLDLS KNKI KALPVQFCQ 
LCELKNLKLDDNEL1QFPCKIG01»INLRFLSAARNKLPFLPSEF 
RNLSLEYLDLFGNTFEOPKVLPVI KL0APLTLLESSART2 LHNR 
IPYGSHIIPFHLCQDLDTAKICVCGRFCLNSF1QGTTTMKLHSV 
AHTWLVDNLGGTEAPI I SY FCSLGCYVNSSD1 


6767 


336 


SIS 


APMI CLCSSDLOFRYKEAFLRDRGLOI GYCSVDDDPRMKHFLNV 
GRLOSDNEYKKDFAKSRSQFHSSTDOPGLLOAKRSQQLASDVHY 

PPGSYKVEMARRAAilJu^ARGLGLQGAYRGAEAVEAGDHOSGEV 
N PDATE I LHVKKKKALLL 


6768 - 


2 


363 


PGSTISCYLLSEGSLPLCMQVACGEEKHRAPTMKTLRARFKKTE 
LRLSPTDLGSCPPCGPCP1PKPAARGRRQSQDWGKSDERLLOAV 
ENNDAPRVAAL3ARKGLVPTKLDPEGKSAFHL 


6765 


2 84 


396 


KSTPDFSTAEN^OEIANEVSCIjKAMLTLMLOAMGQAD 


6770 


t 


397 


0RNYQVIWSSTMAKLHDYYKDEWKKLMTEFNYNSVMOVPRVEK 
ITLNMGVGEAIADKKLLDNAAADLAAI sgokplitkarksvagf 
Kl RQGY P I GCKVTLRGERMWEFFERL I T I AVPR I RDFRGLSAKS 


6771 


3 


376 


APAGTLAMTGKSVKDVDRYQAVLiANLLLEEDNKFCADCQSKGPR 

waswn igvfi ci rcag ihrnlgvh 2 srvxs vnldowtqeo i qcm 
oemgngkanrlyeaylpetfrrpoidpylfwsnleg 


6772 


1 


1400 


AAAFLOGMTVNGFINTVlTSL\ERRYDLHSYQSGIiZASSYDIAA 
CLCLTFVSyFGGSG\HKPRWLGWGR\VLMGTGSLVFALPHFTAG 

P* + gwkldagvrtcpanpr\ pvcag \htsglsryqlvfmlgqfl 
hgvgatplytlg vtyldenvksscsp 1 yiai fytaailgpaagy 
li ggallni ytemgrrtelttesplwvgawwvgflgsgaaafft 
avpilgyprqlpgsqryavmraaemhqlkdssrgeasnpdfgkt 
irdlplsiwlllknptfillclagateatlitgmstfspkfles 
ofslsaseaatlfgylwpaggggtflggffvniarlrgsavik 
fclfctvvsllgilvfslhcpsvpmagvtasyggsllpeghlnl 
tapcnaacs cqpehyspvcgsdglmy fslchagcpaatetnvdg 
okvyrdcsci pqnlssgfghatagkctst 
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BNSDOCID: <WO 0153312A1_I_> 



WO 01/53312 



PCT/US00/3-1263 



SEC- 
ID 
KG: 


Predicted 
beginning 
nucleotide 
i oca t ion 
corre sponc i ng 
co first 
amino acic 
residue cf 
amino acic 
sequence 


Predicted end 
nucleotide 
location 
correspond ing 
to first 
amino acid 
residue ol 
amino acic 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, OCysteine, D=Aspartic Acic, E= 
Glutamic Acid, F= Phenylalanine, G=Glyclne, 
H^Histidine, I=Iscleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, 0=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V«=Valine, 
W«= Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 


6772 


2 


630 


PWEAPKEHKYKAEEHTWLTVTGEPCHFPFOYHRQLYKKCTKKG 
KPGPQPWCATTPNFDODQRWGYCLEPKKVKDHCSKHSPCQKGGT 
CVNMPSGPHCLCPQHLTGNHCQKEKCFEPQLLRFFHKNEIWYR? 
E0AAVARCOCKGPDAHCQRLAS0ACRTNPCLHGGRCLEVEGHRL 
CHCPVGYTG?FCDVGE*GSGASRRPAPRWDGLAR 


6774 


146 


385 


LTELSDQ0YFLFFILSS/WVPTFLSMDVDGRVIKADSFSKI1SS 
GLRIGFLTGPKPLIERVILK10VSTLHPSTFNQLMIS0 


6775 


104 


614 


TCPSQLRVLTARGGRRAPSPQLKTLVLALIESKWRSHRILRKNS 
GRPETMENLPALYTIFOGEVAMVTDYGAF1KIPGCRK0GLVHRT 
HMSSCRVDKPSEIVDVGDKVWVXL1GREMKNDRIKVSLSKKWN 
OGTGKDLDPNNV\SLSKKRGGGDPSRlTLGRRSPLRbS 


6776 


3 


noe 


KERHERHEGALSODAbLRISIPLDSNMRPEKCRRFVHPQWOLLH 
LNGTFPKTSDADMEPCVDGKVYDRISFSSTIVTEWDLVCDSCSL 
TSVAKFV FMAGMMVGG I LGGH LS DR FGRR FVLR WC YLQVA I VGT 
CAALAPTFLI YCSLRFLSG I AAMSLITNT J ML I AEWATH R PQAM 
G1TLGMCPSGIAFMTLAGLAFAI RDWHIL0LWSVPYFVI FLTS 
SWLLESARWLI INKKPEEGLKELRKAAHRSGf4KNARDTLTbEIL 
KSTMXKELEAAQKKKPFLGERLHMPNICKRISLLPFTKFAMFKA 
YFGLNLHG / LKHLGNNVFLLQTLFGAV/ TP PGOUVLHLGH WGSG 
RVS SRGRVNCLGLFVLQVW 


6777 


779 


63 


CFFHGPAWRDCSVRATFAKKOGOSGIISCJAFSPAQPLYACGSy 
GRS LGLYAWDDGS PIiALLGGHQGGI THLCFH PDGNRFFS G AR KD 
AELLCWDLROSGYPLWSLGREVTTNORIVFDLDPTGQFLVSGST 
SGAVSVWDTDGPGNDGKPEPVLSFLPQKDCTNGVSLHPSLPLLG 
H CLPVS VCFL S PTESGGRRRG AGPS LGS PRRHVH LE CRLQLW W C 
GGGARLQ1IP* * SPRARKGR 


6778 


311 


60S 


I QS X TDESRGS 1 RRKNPANTRLELNVPX EETAGDS E/ER SPEEE 
VOADPRIRSASPKCPTSSPFPKGRSPEGEGET\DPEKVHFHPGP 
XDKSVAEKN \XGP\SPVSSEGJ KDFF SMKPE WENLNQSNV RRMK 
T\AVRLNEVIVKKSRDAKLVLLNMPGPPRKRKGDENY 


677& 


2 


53b 


RALRRQPRLLAANGI EPESMA1 SEP I KGSRKPCWKEEUibKKP 
MAKCAWKGPREPPODARAEAES PGGASESDQDGGHESPPK KKAV 
AVJVSAKNPAPMRKKKK\ r SLGPVSYVLVBSEDGRKKPVMPKKGPG 
SRREASDOKAPRGQCPAEATASTSRGPKAKPEGSPRRATNESRK 
V 


6780 


3 


403 


HE VN DNKPE INI NLMSPGKEE I £ Y 1 PEGDF 1 DT FVALVR VQDKP 
SGLNGEIVCKLHGHGHFKLQKTYENWYLILTNATLDREKRSEYS 
LTVIAEDRGTPSLSTVKKFTVOJNDINEMPPHFORSRYBrvISE 
K 


€731 


1 


1269 


AFTRPVFPTLODLSSSKEPSNSLNLPHSNELCSSLVHPELSEVS 
SNVAPSIPPVMSRPVSSSS1STPLPPNQ1TVFVTSNPITTSANT 
S AALPTH LQSALMS TWTMPNAG S KVMVSEG QS AAQSNAR P QFI 
TPVFINSSSIIQVMKPSQPSTIPAAPLTTNSGLMPPSVAWGPL 
HIPONIKFSSAPVPPNALSSSPAPNIQTGRPLVLSSRATPVQLP 
SPPCTSSPWPSHPPVQQVKEIiKPDEASPQVNTSADQNTLPSSQ 
STTM VS P LLTNS PG SS GNRRS PVS S SKGKGKVDK I GQ ILLT KAC 
KKVTG S LEKG EEO YGADGETEGCGLDTTAPG LMGTEQLST E L£)S 

ft J e I trtriW 1 libftll J. o2>ir vUPOiAt/ioJrolAWauAJjr i ov-Kia i V 1 1 

I>V P SEL I S AVPTT KSNKGG I ASE SLAG 


6782 


3 


1327 


RKPTVIRIPAKPGKC1/KEDPQSPPPLPAEKPIGNTFSTVSGKLS 
NVERTRNLESNKPGQTGGFVRVPPRLPPRPVKGKT1 PTQOP PTK 
VPPERPPPPKLSATRRSNKKLPFNRSSSDMDLQKKOSNLATGLS 
XAKSQVFKNODPVLPPRPKPGHPLYSKYMLSVPHGIANEDIVSQ 
NPGELSCKRGDVLVMLKOTENNYLECQKGEDTGR\rHLSOMKLlT 
PLDEHLRSRPNPFSPPKAPSHAOKPVBSGAPHAWLHDFPAEQV 
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BNSDOOD: <WO 015331 2A1_I_? 



WO OJ/53312 



PCTAJSOO/34263 



SEQ 
3D 
NO: 


Predicted 1 
beginninc | 
nucleotide 
location 
ccrre spending 
to first 
amino Ecid 
residue cf. 
amino £cid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
tc first 
amine acid 
residue of 
amino acid 
sequence 


Amino acid .vegment containing signal peptide 
(A^Alanine, OCysteine, D=Aspartic Acid, E= 
Glutamic Acid, F= Phenyl alanine, GsGlycine , 
H=Histidint. , I = Isoleucine, K=Lysine, 
L=Leucine, K=Methionine, N=Asparagine , 
Pr Proline, 0=Glutamine, R^Arginine, 
S=Serine, 1 -Threonine , VsValine, 
WcTryptopten, Y=Tyrocine, X«Unknown, *»Stop 
Codon, Apcssible nucleotide deletion, 
\=possible nucleotide insertion) 








DDLNLTSGEIVVLXjEKIDTDWYRGNCRNQIGIFPANYVKVIIDI 
PEGGNGKRECV5SHCVKGSRCVARFEYIGEQKDELSFSEGEIII 

lkeyvneewa:-. g e vrgrtg I f plnfve pvedyptsganvls tkv 

PLKTKKEDSG.^'?SQVNSLPAEWCEALKSFTAETSDDLSFKRGDR 
I 


6783 


3 


1750 


SYHKHHAQQSa./vASPNLTASQKTVTTTSMITTKTLPLVIjKAATA 
TMPASWGQRF 7 I AMVTA INSQKAVLSTDVQNTP VNLQTS S KVT 
GPGAEAVQIVA}L\ T TVTLQVQATPPQPIKVPQFIPPPRLTPRPNF 
LP0VRPKPVA0JJWIPIAPAPPPMLAAPQLIQRPVMLTKFTP7TL 
PTSQNSlHPVF\ r v7iaGQTATIAKTFPMAQLTSIVIAT?GTRLAGP 
QTVOLSKPSLEKOTVKSKTETDEKQTESRTITPPAAPKPKREEN 
PQKLAFMVSLGLVTHDHLEEIQSKRQERKRRTTANPVYSGAVFE 
?ERKKSAVTYL?:STMHPGTRKRGRPPKYNAVLGFGALTPTSPQS 
SHPDSPENEKTETTFTFPAPVQPVSIiPSPTSTDGDIHEDFCSVC 
RKSG OLLiMGDT C S R VYHLDCLDP PL KT I PKGMW I C P R CQDQM LK 
KSEAIPWPGTLAIVHSYIAYKAAKEEEKQKLLKWSSDLKQEREQ 
LE0KVKQLSNS3SKCMEMKNTILARQKEMHSSLEKVK0LIRLIH 
GIDLSKPVDSEA7VGAISNGPDCTPPANAATSTPAPSPSS0SCT 
ANCNQGEETK 


6784 


3 

* 


1750 


S Y HHHHAQQS A A A S PNLTAS Q KT VTTTS MI TTKT LPLVL KAATA 
TMPASWG0R1 : IAMVTAINSQKAVLSTDVQNTPVNLOTSSKVT 
GPG7AEAVQIV7iyj\ T TVTLOVOATPPQPI KVPQFI PPPRLTPRPNF 
liPQVRPKPVACKN I PI APAPPPMLAAPOLIQRPVMLTKFTPTTL 
PTSONSIHPVRX^GQTATIAKTFPMAQLTSIVIATPGTRLAGP 
0TVCI»S K P SLE > .QT V KSHTE TDE KQTES RT I TPPAAP X P KRE EN 
PQK1AF^S1X1\^THDHLEEIQSKRQERKRRTT7\NPVYSGAVPE 
PERKKSAVTYIKSTMHPGTRKRGRPPKYNAVIiGFGALTPTSPQS 
S K P DS P ENEKTE T T FTFPAP VQP VSLPS P TS TDGD I H ED FCS VC 
RXSGQLLMCDTCS RVYHLDCLDPPLKT I PKGMW1 CPRCQDOMLK 
KEEAI PWPGTLA3 VKSYIAYKAAKEEEKQKLLKWSSDLKQEREQ 
LEOKVKQLSNS j S KCMEMKNTI LARQKEMHSSLEKVKQLI RLIH 
GIDLSKPVDSFJATVGAISNGPDCTPPANAATSTPAPSPSSOSCT 
ANCNQGEETK 


6785 


1 


528 


LGNTVLHYCSMY S KPECLKl>LLRS KPTVDI VNQAGETALDI AKR 
LKATOCEDLLSOAKSGKFKPHVHVEYEt^NLRQEEIDESDDDLDD 
KPSPVKKERSPKPOSFCHSSSISPQDKLALPGFSTPRDKQRLSY 
GAFTNOI FVSTM 1 TDSPTSPTTEAPPIiPPRNAGKGPTGPPITPHR 


6786 


182C 


1397 


RSPKVLVJaAPT^- ^iiANHVSRDFKDI \TRKLTVARFYGGT£YQSQ 
I NH I RNG I DI LVG7PGRI KDHLQSGRLDLSKLRHVVLDEVDOMIj 
DLGFAEQVED1 : H E S YKTDS EDNPQTLLFS ATCPQWV YTVA\ KK 
YMKSRYEQVDL! «G KMTQKAATTV2HLA IQCHWSQRFAV IGDVLQ 
VYSGS EGRAI I FCET KKNVTEMAMNPH I KQNAQCLHGD 1 AQS QR 
EITLKGFREGSF KVIjVATNVAARGLDI PEVDLVIQSSPPQDVES 
Y IHRSGRTGRAGK TG I CI CF YQPRERGQLRYVEQKAGITFKRVG 
VPSTMDLVKSK5. KDAI RSLASVSYAAVDFFRPSAQRLIEEKGAV 
DAIAAALAH I S G A S S r EPRSL 1 TS DKG FVTMTLESLE E I QD VSC 
AWKELNRKLSSKAVSQITRMCLLKGNMGVCFDVPTTESERIiQAE 
WHDSDWILSVPAKLPEIEEYYDGNTSSNSRQRSGWSSGRSGRSG 

FD*VFYHLVDFLSDFLVDSVYLTGRQIDHLTGIjTGLIDHLTSHS 
SVWN 


67e7 


264 6 


2270 


PSSFPKNVPLEHLEEPPK* KRSGLrGSLTPKSQIQNGP* PQTFFF 
FELGSPSGVISA-CNLRLLGSSDSPAPASRVAGIIGTCHHAWLI 
LVFLVEKGFHHVGOAGLKLLTL\VIHPPWPPKVLGLOT 


6788 


16 


S36 


GGTVDLR\DMLAVSVLAAVRGGR/ATVRRVRESNVLHEKSKGKT 
REG AE DKMTSG D VLSNRKM F YLLKTAFP S VQ 1 NTEEHVD \ ELDQ 
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SEC 
ID 
NO : 


Predictec 
beginning 
nu cleot 2. df 
location 
corresponding 
to first 
amino acic 
residue of 

ami nn S r* 

sequence 


Predicted end 
nucleotide 
jocation 
corresponding 
to first 
amino acid 
residue of 
amino acid 


Amino acic segment containing sagnal peptide 
(A=Alanine, C=Cysteine, D^Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine> G=Glycine / 
H=Histidine, I=Isoleucine . K=Lysine, 
L=Leucine, M=Methionine, N=Asparsgine, 
P=Proline, Q=Glutsmine, RsArginine, 
S=Serine, T* Threonine, V-Valint, 
W=Tryptophan, Y^Tyrosine, X=Unknov/n, *=Stop 
Cooon, /—possible nucleotide deletion, 
\spossible nucleotide insertion) 








EVILVJGS*DS* GYPKGK* LLPXEVFSR / RVLLSGLT?LDATQE\ 
FTEDLS K\YVTTMVCVAVKGKPMLGV1 HKF FSEY7AWAMVDGGS 
NVKARS S YNE KTPRI WS RSHSGWVKQVALQTFGNQTTI 1 PAGG 
AGYKVLALLDVPDKSOEKADLYIHVTYI KXWDICAGNAILKALG 

GHK 


6789 


2 


678 


GNG 2 NVXjKI APES A I KFMA YEQ I KRLV W * * PGDS * G F/ YERLVA 
GSliAGAI AOSS I Y PMEVLKTRMALRKTGQY SGMLDCAR R ILARE 
G VAAFYKG Y V PNMLG 1 1 P YAG I DLAVYETLXNAWLQH YAVNSAD 
PGVFVLlACGTMSSTCGObASYPlALVRTRMOAQASIEGAPEVT 
MS S LFKH I LR TEG A FGL YRGLAPNFMJCV I PAVS I S YWYENLKI 
TLGVQSR 


6790 




4068 


A P P AGR R RMQAA PRAG CG AALLLW I VS S C L CRAWTAPS TSQKCD 
EPLVSGLPHVAFSSSSSI SGSYSPGYAKI NKRGGAGGWSPSDSD 
HYQWLQVDFGNRKO I SA1AT0GRYSSSDWVTQYRMI.YSDTGRNW 
K PYHQDGNI WAFPGN .1 NS DG WRHELQHP 1 1 AR YVR 1 VPLDWNG 
EG R I G LR I E VYGCS Y WADVI N FDGHWL P Y R FRN KKMXTLKDV I 
ALNFKTSESEGVILHGEGOOGDYITLELKKAKLVLSLNLGSNQL 
G P 1 YGHTS VMTGSLLDDHKWH SWI EUQGR S I NLTLDR SMQHFR 
TNGEFDYLDLDYEITFGGIPFSGKPSSSSRKNFKGCMESINYNG 
VNI TDLARRKKLEPSNVGNLS FSCVEPYTVPVFFNATS YLEVPG 
RUsfODLFS VSFQFR TWNPMGLLVFSHFADHLGIWE I DLTESKVG 
VHIN1 T0TKMS0I D I S SGSGLNDGQWHEVR FLAKEN FAI LTIDG 
DEASAVRTNSPLQVKTGEKYFFGGFLNQMK^SSHSVLQPSFQGC 
MQL1 0VDDQLVN1.Y E VAQRK PGSFANVS I DMCA! I DRCVPNHCE 
HGGKCSQTWDS FKCTCDETGY SGATCHN S I YEPSCE AY KHLGQT 
S NY YW I DPDGSGPLGPLKVYCNMTEDKVWTIVSHDLOMQTPWG 
YNPEKYSVTOLVYSASMDOISAITDSAEYCEQYVSYFCKMSRLL 
NTPDGSPYTWWVGKANEKHYYWGGSGPGIOKCACGIERNCTDPK 
Y Y CNCDADYKQWRKDAG FLS YKDHLPVSOVWGDTDROGSEAKL 
S VGPLRCOGDRNYWNAAS FPNPS S YLHFST FOGETSADI SFYFK 
TLTPWGVFLENMGKEDFI KLELKSATEVS FS FDVGNG PVEI WR 
S PTPLNDDQWHRVT AERNVK0ASU5VDRLPO0 1 RKAPTEGHTRL 
ELYSOLFVGGAGGQOGFLGCIRSLRMNGVTLDLEERAKVTSGFI 
SGCSGHCTSYGTNCENGGKCLERYHGYSCDCSNTAYDGTFCNKD 
VGAFFEEGMWliRYNFQAPATKARDSSSRVDNAPDOQNSHPDLAQ 
EEIRFSFS TTKAPC3 LLYI SS FTTDFLAVLVKPTGSLQI RYNLG 
GTREPYNI DVDHRNMANGQPHSVNITRHEKTIFLKLDHYPS VS Y 
HLPSSSDTLFNS PKS LFLGKV I ETG KI DQE 3 H K YNTPG FTGCLS 
RVQFNQlAPLKAALRQTOASAHVHlQGELVESNCGASPLTLiSPM 

A\ Wl FTP S LCTP \VLP * SR* HVS PHKGTLP I PNEAKGAGSRQK 
KPGRRPSMNI^DPPTSQRPIDESKKEWPHLRGGYLAMG 


6791 


1802 


1193 


TGHSGAKGEKGDKGDLGPRGERGQHGPKGEKGYPGIPPEL/PGW 
SAW* S WliTAAS TKVQA I LLPQ P LE * LGL0 1 AFWAS LATH FSKQ 
NSGIIFSSVETNIGNFFDVMTGRFGAPVSGVYFFTFSMMKHEDV 
EEVYVYI^HNGrm/FSMYSYEMKGKSDTSSNHAVLKLAKGDEVW 
LRMGNGALHGDHQRFSTFAGFLLFETK 


6792 


33 


1073 


V RHT^3V^GVDMY t»F SLGS E S PKGA1 GH I VSTE KT I LAVERN KVLL 
PPLWNRTFSWGFDDFS CCLGSYGSDKVLMTFENLAAWGR CLCAV 
CPSPTTIVTSGTSTWCVWELSMTKGRPRGLRLRQALYGHTQAV 
TCLAAS VTFSLLVSGSODCTCILWDLDHLTKVTR LPAHREG I SA 
I TI SDVSGTIVSCAGAHLSLWNVNGQPLASI TTAWGPEGAITCC 
CLM EG PAWBTS 0 1 1 1 TG S QDGMVR VWKT/ VG CED VCSWTAS RRG 
APGSAS KPKRPQVGEEPGLESRAGR * HCFDR E AQQNO P \ P VTAL 
AVS RNHTKLLVGDERGR I FCWSADG* EERGS RGSGTTVPG 
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SEQ 
ID 
NO: 


Predicted 
begin nine 
nucleo t i q$ 
Iocs t ion 
corresponding 
to first 
amino acic 
residue cl 
amino acic 
sequence 


Frecjcted end 
nvc'j eotide 
loc?. t ion 
cor r e spond i ng 
to first 
amir.c acid 
res; cue of 
amino acid 
sequence 


Ammo acid segment containing signal peptide 
<A=Alanine, C=Cysteine, D^Aspartic Acid, E- 
Gjutair.ic Acid, F=Phenylalanine , G=Glycine, 
K=Histicine, 2 =Isoleucine, K« Lysine, 
L^Leucine, M=Kethionine , N=Asparagine . 
P=Proline, 0=G3utamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W= Tryptophan, Y= Tyrosine, X=Unknown, *=Stcp 
Codon, /=possible nucleotide deletion , 
\=possible nucleotide insertion) 


6793 


234C 


605 


gukeanywgsltcagtvslgldaegqevfvpfsavlpkivafkd 
lvfdgwdisslfllaeamrrakvldwglqeolkphmealrprfsv 
yipefiaanosaradnlipgsraoqleoirrdirdfrssagl.dk 
vi vlwtanterfcevi pglndtaenllrt i e lg lie v s?s tlf a v 

ic] LEG CAFLNG 9 PONTLV P GALELAWOP R V F VGGD D F KSGOTK 
VKS VLVDFLI GSGLKTMS I VSYNHLGNNDGEN LS APLC FRSKEV 
SKSNWDDMV0SNPVLYTPGEEPDHCW1 KYVP YVGDS KRALDE 
YTSELMLGGTNTLVLHNTCEDSLLAAPIMLDLALLTELCORVSF 
CTDMDFEPQTFHPVLSLLSFLFKAPLVPPGSPVVNALFRQRSC1 
EN 1 LRACVGLPPCNHMLLEHKMERPGPSLKRVGPVAATY PMLNK 
KG?VPAATNGCTGDANGKLQEEPPMPTT*GPGHTVSRLFLPAAP 
HDPTLKAPTNKGRCHFSPPSTWGSWGL 


6794 


165 


1349 


CDVXRKPEASAH*EKPGPPSRPGVRGGRERAGGRGSHGARSCR\ 

L.rJ\rrf\rf\l J k'&iJni?Vt^E'rl\2Z X 1 U X Iwr Lii\ro£>J\. J I J. V-A-Ijr 

NL PTDI TEEDFKRLFER YGEFS E VFI NRDRG FGFIRLE SRTLAE 
I AKAELDGTI LKSRPLRIRFATHGAALTVKNLS P WSNELLEQA 
FSQFGP VEKA WWDDRGRATGKGFVEFAAK P PAR KALER CGDG 
AFLLTTTPRPVIVEPMEOFD3EDGLPEKLMCKTQQYHKERE0PP 
K FAQ PGTFE FEY AS RWKALDEMEKQQREQVDRN I REAKE KLEAE 

HEEEHRRREEEMIRHREOEELRRQQEGFKPNYMENYVCHFLR 


6795 


1740 


1010 


GPRRQTQVRD1IELDS F* DWAAQETDCAQNSGERL* KGV/ LENFS 
TKS KS AVKI SLDLLSNPLCEODODLLNMVTALDTAMKRMDAFNC 
EKV7CQI0KTVIEPLKKFGSVFPSLNMAVKRRE0ALQDYRRLQAK 

FY GS R LDY FO PS FE S L I RAQ WYYSEMH K I FG D L S HQLDQ PGHS 
DEOR E RENEAKLS ELRALS I VADD 


6796 


48 


683 


GKE I QI PTI KLAWLLFGLE* PVGALGKGWSP* *SHVALGQLGW 
LTRAVRSSWRWELCVSAQEWSORSA*SSPSFVGACPSLNPPET 
SVQEGRDCWQR*LPRLFSALVGQPGCWPQGAPPERCV* PGRCKW 
HL0S0VLR*ERRRCCRCLPRFA*GWRRRH0RLGLGIHPAPLGST 
SPPHPEGNSCOCRR*GWAAELRLPSSWL*GKLGC* 


6797 


1620 


211 


TEKMTPSQPTRGSSCTRPSSMLWTSTWRCLTCKWAGMRKSWGV 
TLGPMAQGLLSASGTTTEATWTRPTTHLTLIRKWLLTASRVDPF 
HRPPPPPSDDLTLLESSSSYKNL/DAQIPO/DWSMSPSTSG*RP 

1 TCD1CCTMDCTJ'?! TDCLQ* CPT.T*TTTH'T , Vf2fJC:PCIVWPPPP*r < IP c ; 

VST P VS SSTETTASGS CLTWWSSS PAPCPS S SAPAHS FEASCCK 
TSLWGS CGGSGDGSSACGSGWNLSMAGTSCSS PAMCS FSRAPS * 
RSASRPRTWRATTSAASSWAPSRCWCGWA*SAT*PSSTTTISSS 
?HCGWPCPASCASAAAWLSSTWATASVAGSCWGPIM*SS7\HSPW 
CLSACSRSSMGTTCL*RSPP\SGASRAAAAWCGSSPSSTFTPSS 
ASSSTWCSASSSRSSPAPTTPSSIPAAQAQRRASCRPTSHSART 
APPPAS S AAGAARPAAFSAAAEGTPRRS I RCK 


6798 


3894 


3696 


STaSKESLESWLNKATNPSNROEDWEYIIGFCDOINKELEG*VS 
ALWG0LRGSGLGRGTTMAKEG0PGSPRLSALECVLLVPQ\P0IA 
VR LLAHKI OSPQEWEALQALTY LGDRVSEKVKTKVI ELLYSWTM 
ALPEEAKIKDAYKMLKRQGIVOSDPPIPVDRTLIPSPPPRPKNP 
VFDDEEK^KLIAKLLKSKNPDDLQEAIJKLIKSMVREDEARIQKV 
TKRLHTLEEVNNNVRLLSEMLLHYSQEDSSDGDRELMKELFDQC 
ENKRRTLFK1ASETEDNDNSLGDILQASDNLSRVINSYKTIIEG 
OV INGE VATLTLPDSEGNS0CSNQGTL3 DLAELDTTNS LSSVLA 
PAPTFPSSGIPILPPPPOASGPPRSRSSSOAEATLGPSSTSNAL 
SWLDEELLCLGLADPAPNVPPKESAGNSOWKLLQREOSDLDFFS 
PRPGTAACGASDAPLLQPSAPSSSSSQAPLPPPFPAPWPASVP 
A PS AGS SL FS TGVA PALA PKVE PAVPGHHGLALGNS AL HKLDAL 
DOLLEEAKVTSGLVKPTTSPLIPTTTPARPLLPFSTGPGSPLFQ 
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SSQ 
ID 
NO: 


Predicted 
becir.ning 
nucj eotide 
1 oca i j on 
corresponding 
to first 
amine acid 
resicue cf 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue cf 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, OCysteine, D=Aspartic Acid, E~ 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, J = I sol eu cine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P-Proline, Q-Glutamine, R=Arginine, 
S=Serine, T= Threonine, V=Valine, 
WoTryptophan, Y=Tyrosine, X=Unknown, *=stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








PLSFQSCX3SPPXGPELSLASIKVPLES3KPSSALPVTAYDKNGF 
RILFHFAKECPPGRPDVLWWSMLNTAPLPVKSIVLQAAVPXS 
M3CVKL0PPSGTELSPFSPIQPPAAITOVMLLANPLKEKVRLRYK 
LTFALGEOLSTEVGEVDQF PPVEQWGNL 


6799 


3694 


1696 


STISWESLESWLNKATNPSNRQEDWEYIIGFCDQINKELEG*VS 
ALWGQLRGSGLGRGTTMAKEGQPGSPRLSALECVLLVPQ\PQIA 
VRLLAKKIQSPOEWEALOALTYLGDRVSEKVKTKVIELLYSWTrt 
ALPEEAKI KDAYHMLKRQGIVQSDPPI PVDRTL.1PSPPPRPKNP 
VFDDEEKSKLLAKLLKSKNPDDLQEANKLIKSMVREDEARIQKV 
7KR LHTLE E VNNNVRLLS EMLLH YSQEDS S DG DRELM KE LFDQC 
ENKRRTLFKLASETEDNDNSLGDILQASDKLSRV1NSYKTIIEG 
QVINGEVATLTLPDSEGNSOCSNQGTLIDLAELDTTNSLSSVLA 
PAPTPPSSGIPILPPPPQASGPPRSRSSSQAEATLGPSSTSNAL 
SWLDEELLCLGLADPAPNVPPKESAGNSQWHLLQREQSDIjDFFS 
PRPGTAACGASDAPLLQPSAPSSSSSQAPLPPPFPAPWPASVP 
APSAGSSLFSTGVAPALAPKVE P A VPGH HGLALGNS A 1 «H H LDAL 
DQLLEEAKVTSGLVKPTTSPLIPTTTPARPLLPFSTGPGSPLFQ 
PLS FQSQG SPPKG PELS LAS I HVPLES 3 KPS S ALPVTAYDKNGF 
RILFHFAKECPPGRPDVLWWSMLNTAPLPVKSIVLQAAVPKS 
MKVKLQPPSGTELSPFSPIOPPAAITQWiLLANPLKEKVRLRYK 
LTFALGEQLSTEVGEVDQFPPVEQWGNL 


6800 


404 


1646 


RRSPSTGLS?VPCPSSPSLSDYSIPWSLLLSGTIAWATPGK*AG 
* PQAW*LGLAPA1AF1 / GLTRGRKQN KEKMAEGGSGDVDDAGDC 
SGARYKDWSDDDDDSNESKS IWY PPWAR IGTEAGTRARARARA 
RATRARRAVQKRASPNSDDTVLS PQELQKVLCLVEMS EKPY ILE 
AALIALGNNAAYAFNRDI IRDLGGLPI VAKI LNTRDPI VKEKAL 
IVLNNLSVNAENORRLKVYMNOVCDDTITSRLNSSVOLAGLRLL 
TNMTVTNE YQHMLAMS I SDFFRLFSAGNEETKLQVLKLLLNLAE 
NPAMTRELLRAQVPSSLG\SLFKKKENKEV1LKLLVJFENINDN 
FKWEENEPTQNOFGEGSLFFFLKEFOVCADKVLGIESHHDFLVK 
VKVGKFMAKLAEHMFPKSQE 


6801 


- 


1755 


SAEEFESQQASVTMHDVDAESFEVLVDVCYTGRVSLSEANVERL 
YAAS DMLOLEYVREACAS FLAR RLDLTNCTA 1 LKFADAFGHRKL 
RSOAQSYIAQNFKOLSHMGSIREETLADLTLAOLLAVLRLDSLD 
VESEQTVCHVAVOWLEAAPKERGPSAAEVFKCVRWMHFTEEDQD 
YLEGLLTKPIVKKYCLDVIEGALQMRYGDLLYKSLVPVPNSSSS 
/R*QQQLSCICSRKSTPETGYVCOGDGDLLWTPQRSLS\RYDPY 
SGDI YTMPS PLTS FAKTKTVTSSAVGVS PDH DI YLAAQPRKDLW 
VYKPAQNS WQQLADRLLCREGMDVAYLNGY I YI LGGRDP ITGVK 
LKEVECY S VQRNQWALVAPVPHS FYS FEL I WQNYLYAVNSKRM 
LCYDPSKNMWLNCASLKRSDFOEACVFNDEI YCICD1 PVMKVYN 
PARGEWRR I SN I PLDS ETHNYQ 3 VNHDQKLLL ITSTTPQWKKNR 
VTVYEYDTREDQWINlGTMIiGLLQFDSGFICLCARVYPSCLEPG 
QSFITEEDDARSESSTEWDLDGFSELDSESGSSSSFSDDEVWVQ 
VAP0RNAODQQGSL 


6802 


157 


1341 


ETFPLFFFLLSKTPGKTASMAHFVQGTSRMIAAESSTEHKECAE 
PS TR KNLMN SLEQK1 R CLEKQR KELLEVNQQWDQQFRS MKELYE 
R KVAELKTKLDAAER FLS TRE KDPHQRQR KDDRQREDDRQRDLT 
RDRLOREEKEKERLNEELHELKEENKLLKGKl^TLANKEKEHYEC 
EI KRLNKALQDALNI KCSFSEDCLRKSRVEFCHEEMRTEMEVLK 
QOV0IYEEDFKKERSDRERLNQEKEELCO3NETSQSOLNRLNSQ 
IKACQMEKEKLEKCLKQMYCPPCNCGLVFHLODPWVPTGPGAVQ 
KQREHPPDYQWYALDQLPPDVQHKAN/DWCLAPPPVCCQAG/PR 
TPGLK*SSCLWLPKC*NFRFILSKESPSVEVHTNRERQQATRER 
G 


6603 


i 


2203 


KLSGRPYRHMGVLGTSKLYDIRKTIFTFTPQFIDQQQFYLALDN 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
tc first 
amino acid 
residue oi 
' amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
<A=Alanine, OCysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Histidine, 3=Isoleucine, K=Lysine, 
L=Leucine, M=Methicnine , N^Asparagine, 
P=Proline, Q=Glutair.ine , R=Arcinint, 
S«Serine, T=Threonine, V=Val:ne, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








KI-lIVEMLRTDLSyLCSRWKMTGOPTITFPISHSKLDEDGTSLNS 
S I LAALR KMQDG Y FGGAR VQTG KL»S E FLTTS CCTHLS FMDPG P E 
GKLYSEDYDDNYDYLESGNV3MNDYDSTSKARCGDEVARYLDKLL 
AHTAPHPKLAPTSQKGGLDRFOAAVOTTCDLMSLVTKAKELHVQ 
jrVHKYLPTKLFOASRPSFNLLDSPHPRQENOVFSVRVEIHLFRD 
OSGEVDFKALVLOLKETSSbQEQADILYMLYTyiKGPDWNTELYN 
ERSATVREbl/TELYGKVGEIRHWGLIRYlSGILRKKVEALDEAC 
TDLLSH0KHLTVGLPPEPREKT1SAPLPYEALT0L1DEASEGDM 
SIS1LT0EIHVYLAMYMRTQPGLFAEKFRLRIGLI1QVMATELA 
HSLRCSAEEATEGLMNLSPSAMKNLLHKILSGKEFGVERK/SVR 
FTDSNVSPA3SIHEJGAVGATKTERTGIM0LKSEIKQVEFRRLS 
ISASSQSPGTSMTPSSGSFPSAYDOjQSSKDSRQGQWQRRRRLDG 
ALNRVPVGFY0KVWKVLOKCHGLSVEGFVLPSSTTREMTPGEIK 
FSVHVES \VLNVLLRPEYROLLVEAILVLTMLADIEIHS IGSI I 
AVEKI VK2 AJTOLFLQEQKTLGP \DDTMLAKDPASG\ 1 CTLR \YD 
SAPSGRFGTMTYLS\RAA\ATYVQEFLP\HS I CAMQ 


6804 




951 


GSPGKKEEKAKNKESLCMENSSNSSSDEDEEETKAKMTPTKKYN 
GLEEKRKSLRTTGFYSGFSEVAEKRIKLLKNSDERLQNSRAKDR 
KDVWSSICGQWPKKTLKELFSDSDTEAAASPPHPAPEEGVAEES 
LQTVAEEESCSPSVELEKPPPVNVDSKP1EEKTVE\ 7 NDRKAEFP 
S5G£NFSA*IPLPYLHLNRLHQSL*0KGSR0O£SVTVSEPLAPN 
CEEVRS I KS ETDST 3 EVDS VAGELQDLQSERE * LAS RF * COCb'L 
K0**SARTRTS*KSLYRSEKSERCSGRRKF3KKAEKKP*SNSGK 
0QKEGKRHK 


6805 


1539 


206 


RQPDLKYFGKSFDVSVSESSSLLSNDLPKFADGIKARNRNQNYL 
VPS P VLR 3 LDHTAFSTEKS ADI V 1 CDEECDS PES VNQQTQEESP 
J EVHTAEDVP I AVEVHAI SEDYDI ETENNSSESLQDQTDEEPPA 
KLCKILDKSOA^NVTAQOKWPLLRANSSGLYKCELCEFNSKYFS 
DLKQHMILKK KRTDSNVCRVCKE S FSTNMLLI EHAXLHEEDFY I 
CKYCDYKTV1FENLS0HJADTHFSDHL.YWCE0CDV0FSSSSELY 
LHFQEHSCDEOYLCOFCEHETNDPEDLHSHWKEHACKLIELSD 
KYNNG EHGOYSLLS Kl TFDKCKNFFVCOVCGFR SRLHTNVNRHV 
A I EHTKI FPKVCDDCGKGFS SKLE\ IAKHLNSHLSEGI YLCQYW 
E YS TGQI EDLKI HLDFKHS ADLPHKCS DCLMRFGNERELI SKLP 
VHETT 


6606 


272 


3794 


VALCFPNSDPVMFMDAFYGCLLAELGPVPIEVPbTRKDAGSQQV 
GFLLGS CGVFLMjTTDACQXGLPKAQTGEVRAFKG WP PLS V3LV1 
DGKHLAKPPKDWH PLAQDTGTGTAY I S YKTE KEGSTVGVTVSHA 
SLLAOCRALTOACGYSEAETLT^LDFKRDAGLWHGVLTSVKNR 
MHVVS VPYALMKANPLS W 1 QKVC FY KARAALVK S RDMHWSLIAQ 
RGQRDVSLSSLRMLIVADGANPWSISSCDAFLNVFQSRGLRPEV 
3 CPCASSPEALTVAIRRPPDLGGPP PRKAVLSKNGLSYGVI RVD 
TEEKLSVLTVQDVGQVMPGANVCVVKIjEGTFYLCKTDEVGEICV 

sssatgta yygllgitknvfeavpvttggapifdrfftrtgllg 
f 1 g p dhlvf 3 vg kl dg lm vtg vr rhnadd watalave pmkfvy 
rgriavfsvtvlhddrivlvaeqrpdaseedsfqwmsrvlqaid 
s i ho vgvy c lalv p an tlp kaplgg ihisetkqr flegtlh pcn 
vlmcphtcvtnlpkprqkopevgpasmivgnlvagkriaqasgr 

ELAHLEDSDQARKFLFLADVIjQWRAHTTPDHPLFIjLLNAKGTVT 

statcvqlhkraervaaalmekgrlsvgdhvalvyppgvdliaa 
fygclycgcvpvtvrpphponlgttlptvki^ivevsksacvltt 
oavtrllrs keaaaavdirtwpti ldtddi p kkk i as vfrppsp 
dvlayldfsvsttg i lag vkmshaatsalcrs j klqcelypsrq 
iaicldpycglgfalwclcsvysghosvlvpplelesnvslwls 
avsoykarvtfccysvmemctkglgaqtgvlrmkgvnlscvrtc 
kwaeerpVrialtosfsklfkdlglparavsttfgcrvnvaic 
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: SEO 
ID 
1 ?JO: 


Predicted 
bee i nn i no 
nucleot ide 
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amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
1 oca t ior. 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sGouenct 


Amino acid secment containing signal peptide 
(A^Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F— Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamine, R^Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








LQGTAGPDPTTVYVDMRALRHDRVRLVKKGSPHSLPLKBSGKIL 
PG VKV 1 3 AH TETKG PLGDSHLGE I WVS S PHNATGY YT V YG E EAI> 
HADHFSARLS r GDTQTI WARTG YbGFLRRTELTDASGGRHDALY 
WGSLDETLELRGMRYHPID3 ETS VIRAHRS IAECAVFTWTNLL 
WWELDGLEODALDliVALVTNWLEEKYLWGVW J VDPGVI P 


6807 


1444 


606 


VGKDTVHAf»lFTCFPKCLGFSPP\WTVSPRSEESHTTTVSGGNG 
. SVFOAGP0LOALANLEAKRGS1 GAALSSRDVSGLPVYAQSGEPR 
RLTQAQVAAFPGENALEHSSDODTWDSLRSPGFCSPLSSGGGAE 
SLPPGGPGHAEAGHLGKVCDFKLNHQQPSPTSVLPTEVAAPPLE 
K I L S VDS VA VDCAYRT VPKPG P 0 PG PHG S LLTEGCLR S LSGDLN 
RFPCGMEVHSGORELESWAVGEAMA\LKFPMGAMSYCLRDRSR 
FLFRLPMGLSCPLQVQ 


SBOE 


2063 


737 


GVGSGAASALARSRPLASRIiSSRRRTRAPRSGAMQRliAMDLRML 
SRELSIiYLEHQVRVGFFGSGVGLSLILGFSVAYAFYYLSSIAKK 
PQLVTGGES FSR FLQDHCPW7ETYYPTVWCWEGRGQTLLRPF\ 
j.Ti?KPPVyY RNELIKTADGGyaSljDwr DNDNb lUiMDASTRPTl 
LLLPGLTGTS KES Yl LHMI HL£ EELG YRCWFNNRGVAGENLLT 
PRTYC CANT E DLET VI HH VFJS L Y PS AP FbAAG VSMGGMLLLNYL 
GKIGSKTPLMAAATFSVGWNTFACSESLEKPLNWLLFKYTLTTC 
LOSSVNKHRHMFVKOVDMDHVKKAKS I REFDKRFTSVMFGYQTI 
DDYYTDAS PS PRLKSVG I PVLCLNS VDDVFS PSHA1 P 3 ETAKQN 
PNVALVLTS YGGH1 GFLEG I W PROS T YMDR VFKQFVQAM VEHGH 
EliS 


f>809 


939 


65 


DYSGQTPVPTEHGMTLYTPAOTHPEQPGSEASTQPIAGTQTVPQ 
TDEAAQTDSQPbHPSDPTEKQQPKRLHVSNIPFRFRDPDLRQMF 
GQFGKI LDVE 1 1 FNERGS KG FG F VT FETSS DADRARE KLNG TI V 
EGRKIEVNKATARVMTNKKTGNPYTNGWKLNPWGAVYGP2FYA 
VTGFPYPTTGTAVAYRGAHLRGRGRAVYNTFRAAPPP PP I PTYG 
AVVYQDGFYGAEI \LEAT0PTB1-LS PLORRQPTATVTAESTQLP 


6810 


S3S 


6E 


DYSGQTPVPTEHGMTLYTPAQTHPEQPGSEASTQPIAGTQTVPQ 
TDEAAQTDSQPLHPSDPTEKQQPKRLHVSNIPFRFRDPDLRQMF 
GQFGKI LDVE 1 1 FNERGS KG FGFVTFETSSDADRAREKLNGTI V 
EGRKI E VNNATAR VMTNKKTGNP YTNGWKLNPWGAVYG PE FYA 
VTG FP YPTTGTAVAYRGAHLRGRGRAVYNTFRAAPPPPP I PTYG 
AW YQDGFYGAE I \LEAT0PTDTbS PliCRRQPTATVTAES TQLP 
TRT 1 TPSGPRRPTALEPCETFKR FLLGP 


6811 


1522 


658 


DLVTVWSFVDCRVIASTHGH \ KSVTVSWAFDPYTTSVEEGDPME 
FSGSDEDFQDLLHFGRDRADSTQCRLSRRNSTDSRPVSVTYRFG 
SVGQDTQLCLWDLTEDILFPHOPLSRARTHTNVMNATSPPAGSN 
GNSVTTPGNSVPPPLPRSNSLPHSAVSNAGSKSSVMDGAIASGV 
SKFATLSLHDRKERHHEKDHKRNHSMGKISSKSSDKLNLVTKTK 
TDPAKTLGTPLCPRMEDVPIiIjEPLJ ckk i aherltvli fledci 
VTACQEG Fl CTWGRPGKWS FNF 


6812 


4001 


1682 

* 


eda vfsldlsti i qgtwflngeelksnepegqvepgalr yr i eq 
kglohrliuiavkhqdsgalvgfscpgvqdsaaltiqespvhil 
spcdkvsltfttservvltcelsrvdfpatwykdgqkveesell 
wkmdgrxhrl1lpeakvqdsgefecrtegvsaffgvtvqdppv 
hi\tprehvfvhaitsecvmlacev\dr\edapvrwykdgqeve 
esdfwlenegphrrlvlpatqfsdggefqcvagdecayftvti 
tdvsswivypsgkvyvaavrlerwltceix^pv?aevrwticdge 
ewespalllokedtvrrlvlpavqledsgeylceiddesasft 

VTVTEPPVR3IYPRDEVTL3AVTLECVVLMCELSREDAPVRWYK 
DGLEVEESEALVLERDGPRCRLVLjPAAQPEDGGEFVCDAGDDSA 
FFTVTVTE P PVQ FLALETTP S P LCVAPG E PWLS CE1>S RAGAPV 
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sequence 


Amino acid segn-.ent containing signal peptide 
(A»Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acic, F=Phenyla2anine, G=Glycine, 
H^Histidine, l^lsoleucine, K=Lysine, 
L= Leucine, M=Methionine , N=Asparagine, 
P^Proline, Q=Glutamine, R-Arginine, 
S=Serine, T=Threonine, V=-- Valine, 
w= Tryptophan, Y=Tyrosine, X=Unknown, **Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








WSHNGRPV0EGEGLELKA2GPRRVLCIQAAGFAKAGLYTCQSG 
AAPGAPSLSFTVQVAEPPVRWAPEAAQTRVRSTPGGDLELVVK 
LSGPGGPVKWYKDGERLASQGKVOLEQAGARQVLRVOGARSGDA 
GEYL CDAPQDS R I FL VSV EE PLL VK L VSDLTP L TVH EGDDATFR 
CEVSPPDADVTWLRNGAWTPGPCRQSCCSYGGCRMCGORKART 
CVS KWRQAEWVC/RG PCAG CE VG S PC PTTLACP ft" PRMG TS T&SSS 
MVSYWPTRAPTAARATTIAPWPGSA 


6813 


9 


83 i 


SSTQQRPGVPAGPR PLDGYLGVADH K PLKMHCRDCALVTSSGHL 
LHSRQGSOIDQTECV3RMNDAPTRGYGRDVGNRTSLRVIAHSSI 
QRILRNRHDLLMVSQGTVFI FWG PSSYMRRDG KGQVYNNhHLLS 
QVLPRLKAFMITRHKMLQFDELFKOETGQ\NRKISNTWLSTGWF 
ITITIALELCDRI^vTGMGPPDFCRDPNHPSVPYKYYEPFGPDEC 
TMYLSHERGRKGSHHRFITEKRVFKJ^WARTFN] KFFQPDWKPES 
LAINHPENKPVF 


6 814 


3 


73^ 


KFRROEAN /AR E RNRMHGLNDALPNLR KWPC Y S KTQKLS K I ET 
LRLAKNYIWALSEILRJGKRPDLLTFVQNLCKGLSOFTTNLVAG 
CLQLNARS FLMGQGG E AAHHTRS PYSTFYPPYHS PELTTP PGHG 
TLDNS KSMKPYN YCSAYES FYES TS PECAS PQFEGPLSPPPINY 
NGIFSLKQEETLDYGKNYNYGMKYCAVPPRGPLGQGAMFRLPTD 
SH FP YDLMLR S QSL TMC/DELNA V FHN 


6815 


906 


55 > 


QGLDPASQTKWELLKDGSGRRGDRRSSRDMAGGAGPRSESDLE 
DVGPTAEWNGDGSGSLRRSGSFGKLRDALRRSSEMLVKKLOGGT 
PQEPPNPRMKRASSliNFLNKSVEEPTQPGG 


6816 


1 


sc:- 


NLLKTHKF\LLGQDEDSLHSVPVAOMGNYOEYLKTLASPLREID 
PDQPKRLHTFGNPFKODKKGMM1 DEADEFVAG PQNKVKRPGEPN 
SPMSSKRRRSMSLLLRKPOTPPTVTNHVGGKGPPSASWFPSYPN 
LIKPTLVHTDATI I HDGHEEKMENGQI TPDGFLS XSAPSELINM 
TGDLM P PNQVDS LS DDFTS LS KDGL I QKPGSNAFVGGAKN CS LS 
VDD0KDPVASTLGAMPNTLQITPA-MAOGINAD3 KHQLMKEVRKF 
GRSK 


6817 


172 


345-, 


LiGMMDSPKIGNGLPVJGPGTDIG^SSLHMVGYlGKNFDSAKVPS 
DEYCPACKEKGKLKALKTYRISFOESI FLCEDLQCI YPLGSKSL 
NNLISPDLEECHTPHKPQKRKSLESSYKDSLLLANSKKTRNYIA 
IDGGKVLNSKHNGEVYDETSSNLFDSSGQONPIRTADSLERNEI 
LEADTVDMATTKDPATVDVSGTGR P S PQNEGCTS KLEKPLESKC 
TSFP0ALCV0WKNAYALCWLDC2LSALVHSEELKNTVTGLCSKE 
ESI FWRL»LTKYNQANTLLYTSQI,SG VKDGDCK KLTS EI FAE I ET 
CLNEVRDEIF1SL0P0LRCTLGDKESPVFAFPLLLKLETHIEKL 
FLYSFSWDFECS0CGH0YQNR«MK£LVTFTNV3PEWHPLNAAHF 
GPCNNCN S KS 0 1 R KMVLE KVS P I FM LH FVEGL P ONDLQHY AFHF 
EGCLYQ I TS V2 C Y RANNHF I TW I LDADGS WLEC D DLKGP CS ERH 
KKFBVP AS EIKIVIWERKI SQVTDKE AACLPLX KTNDQHALSNE 
KPVSLTSCSVGDAASAETASVTHPKDISVAPRTLSQDTAVTHGD 
HLLSGPKGLVDN I LPLTLEETI OKTAS VSQLNS EAFL\LENKPV 
AENTG I LKTNTLLS QES LMAS S VS A P CNEKLI ODQFVD I S F PSO 
VVOTNMQSV0LNTEDTWTKSVNKTDATGLIOGVKSVEIEKDAQ 
LKQFLTPKTEOLKPERVTSQVSNXKKKETTADSOTTTSKSLQNQ 
SLKENOKKPFVGSWKGLISRGASFMPLCVSAKKRNTITDLQPS 
VKGVNNFGGFKTKGINQKASHVSKKARKSASKFPPISKPPAGPP 
SSNGTAAHPHAHAASEVLEKSGSTS CG AOLNHS S YGNGI SSANH 
EDLVEG0IHKLRLKLRKKLKAEIOCKIAALMSSP0SRTVRSENLE 
0VP0DGS PNDCES I EDLLNELP YP I DJANESAC7TVPGVSLYS S 
QTHEE 1 1AELLS PTPVSTELS ENGEGDFRYLGMGDSHIPPPVPS 
E FNDVS ONTHLRODHN YCS PTK KN F C E VQPDS LTNNACVRTLNL 
ES PKKTD I FDEFFS SS ALNAIANDTLDLPHFDE YLFENY 


6818 


2 


240 


RGFJDKVLWT/LSGAVK\CVQFSR2SPDGEEGYPGELKVWVTYTL 
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Amino acic segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acic, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H^Histicine, I=Isoleucine, X=Lysir.e, 
L=Leucine, M=Me thionine, N=Asparagine , 
P=Proline, Q=G2ut amine, R=Arcinine, 
S=Serine, T= Threonine, VtValine, 
W=Tryptophan, Y= Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 






DGGE/LHS/ATTEHKP/VQATPVNLT\TILTSTW0ARLPOl j 




1 


96 j 


GiPCTEMGNFDNANVTGEIEFAIHYCFKTHSLEICIKACKNLAV 
GEEKKKKCNPYVKTYLLPDRSSOGKRKTGVQRNTVDPTrOETLK 
YQVAPAOLVTRQLQVSVWHLGTLARRVFLGEVIIPLATWDFEDS 
TTQS FR WK P LRAKAD K Y EDS VPQ S NGE LTVRAKLVLP S RPR KLQ 
EAQEGTDOPSLKGQLCLWLGAKiaPVRPDGTLNSFVKGCLTLP 
DQOKLRLKS PVLRKOACPQWKHS FVFSGVTPAQIiRQSSLELTWJ 
DOALFGMNDRLLGGT\ R T GSKGDTAVGGDACSO^KT.nwOKUT.^^ 
PNLWTDMT LVLH 


6e20 


1014 


340 


GDMVYIVGirVPPGFFEKTONKAWFREGFNEKYLKWRKHHRVlA 
GQFFGHKKTDSFRMLYDDAGVPISAMF3TPGVTPWKTTLPGVVN 
GANNPA 3 RVFEYDKATLSLKDMVTYFMNLSQANAOGTPR WELE Y 
OLT EA YG V PDA ^ AHSMHT VT ,DP 1 AC? DO c! TT .OR YYUYN ^V^Y^Afi 

VCDEACSMOHVCAMRQVDIDAYTTCLYASGTTPVPQLPLLLMAL 
LGLCT 


6821 


1088 


5ie 


EFDIYR/EVGGEFVPVTRDDSSNGFPRTQHGPSPTVHPIQSPQN 
RFCVLTLDPETLPA3 ATTLI DVLF YSHSTPKEAASSS PE PSS IT 

GQPLGFDECGI VAQ IAG PLAAADI SAYYISTFNFDHALVFEDGI 
GS VI EVLORROEGLAE 


6622 


1088 


516 


EFDJ YR / E VGGE F VP V TRDDS SNG FPR T QHGPS PTVHP I OS PON 

FFAFSLIEGYI \ S I VMDAETQKKFPSDLLLTSSSGELWRMVRIG 
GQPLGFDECGI VAOIAGPLAAADI SAYYISTFNFDHALVPEDG1 
GSVIEVLQRRQEGLAS 


6823 


654 


223 


PPKLLSRWARMGKGDE 3 V \LSDLNFPGLLHliPWGPWRS VQTAC 
GIPQLLEAVLKLLPLDTYVESPAAVMELVPSDKERGLQTPWTE 

ILRKGVLALNPLL 


6824 


858 


104 


LLLAOR KGWG \ CCFFSIAVS VKMNVLLFAPGLLFLLLTQ FGFRG 
ALPKLGICAGLQWLGLPFLLENPSGYLSRSFDLGRQFLFHWTV 
NWRFLPEALFLHR/iFHLALLTAHLTLLLLFALCRWHRTGES I LS 
LLRDPSKRKVPPOPLTPNQIVSTLFTSNFIGICFSRSLHYQFYV 
WYFHTLPYLLVJAf-lPARWLTHLLRLLVLGLIELSWNTYPSTSCSS 
AALH I CHAVI LLQLWLGPOPFPKSTQKS KKAH 


6625 


3 


1173 


SSGEFGL0ASD1MWTISDTGWILI1LCSI^4EPWAL,GACTFVHLL 
PKFnPLVTT,KT^,^^YP^K^MMGAPIVYR^lI 1 liOOD^ J < ;c;Yvr•pLI]r^ 

l XvL LTD v _l UJ\ i JJO O J r 1 AOi'irJunr iv; X\ * J \ 91 i W^X<o O A X\ I i n 

NCLAGGESLLPETLENWRAQTGLDIREFYGQTETGLTCMVSKTM 
KI KPG YMGTAASCYDVQI IDDKGNVLPPGTEGDIG I RVKP I RPI 
G I FSG Y VDN F DKTAAN I RGDFWLLGDR G I KDEDGY FQFMG RADD 
IINSSGYR3 GPSE VENALMEH PAWETAVISSPDPVRGE WKAF 
VI LALOFLSKDPEQLTKELQQHVKSVTAPYKYPR KI EFVbNLPK 
TVTGKIORA\KLRDKEWKMSGKAPCAVRHLRDIHLDSPLliSLSF 
PFGPLALPMDGYGDSLWEEHEYKFCLALVISTKLYHVRC 


6826 


2304 


954 


LKTESFKPW/VN1 ALA FHLLG ERAS PNSFWQPY1QTL PRE YDTP 
LYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKVIOTHPHA 
NKLPLKDS FTYEDYRWAVSSVMTRQNOI PTEDGSRVTLALI PLW 
DMCNHTNG b I TTG YNLEDDR CECVALQD FRAGEQ I Y I FYGTRSN 
AEFVIHSGFFFDKNSHDRVKIKLGVSKSDRLYAMKAEVLARAGI 
PTS S V F ALH FT E P P I S ACLLAFLR V FCMTEE ELKEHLLGD S A I D 
RI FTLGNS EFFVS W DNEVKLWTFLEDRAS LLLKTYKTTI E EDKS 
VLKNHDLSVRAKr4AIKLRLGEKEILEKAVKSAAVNREYYKQ0ME 
EKAPLPKYEESNLGLLESSVGDSRLPLVLRNLEEEAGVODALNI 
RSAIS KAKATENGL VNGEKS I PNGTRSENESLNOES KRAVEDAK 
GSSSDSTAGVKE 
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Ammo acic segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H^Histidine, I -I scleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine . 
P=Proline, Q=G3 ut amine, R=Arginine, , 
S=serine, T= Threonine, v=Valine, 
W= Tryptophan, V= Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion. 
\spossible nucleotide insertion) 


6827 


1 ! 779 

i 

j 


SS WE FGLSVLGGLFLLFVLENMLGLLRHRGLRPRCCRRKRRNL 
ETRNLDPENGSGMALOPLQAAPEPGAQGOREKNSOHPPALAPPG 
H0GHSHGH0GGTD3TWMVLLGDGLHNLTDGLAIGAAFSDGFSSG 
LSTTLAVFCKELPHELGDFAMLLQSGLSFRRLLLLSLVSGALGL 
GGAVLGVGLSLGPVFLTPWVFGVTAGVFLYVALVLMLPALFPSS 
GAP AYA\HVLL(X3LG LLLGGCLMLAI tlleerllpvtteg 


6828 


■a 


I6b4 


KSQHG/WILOLMHSCKEGYVKDIjKGNPGLHRAMLDLDNGTRFSE 

i^hlsctaslkrgssfqsgrddtwryktphrvafvekltklvls 
qlpnfwklw i s yvngs lfsetae ksgq1 ers knvrqrokdfkkm 
iqevmhslvkltrgallplsirdgeakoyggwevkcelsgqwia 
ha1 qtvr lthe sltale i pndllqti qd li ldlrvrcvmatlqh 
taesikrlaekedwivdnegltslpcqfeqcivcsloslkgvle 

CKPGEASVFOOPKTQEE VOQLS INI MQVF1 YCLEQLSTKPDADI 
DTTHLSVDVS S PDLFGS I HEDFSLTSEQRIfLI VLSKCCYLERHT 
flniaehfekhntfqg: EKITQVSMASLKELDQRLFENYI ELKAD 
PI VGSLEPGI YAGYFDWKDCLPPTGVRNYLKEALVNI I AVHAEV 
FT I S KELVP R VLS K V 3 EAVSEELSRLMOCVSSrSKNGALQARLK 
I CALRDTVAVY LTPES KSSFKQALEALPQLS SGADKKLLEBLLN 
KFKSSMHLQLTCFQAASSTMMKT 


6029 


I 


782 


MRMEAGEAAPPAGAGGRAAGGWGKWVRLNVGGTVFLTTRQTLCR 
EQKSFLSRLCOGEEL0SDRDETGAYLIDRDPTYFGPILNFLRHG 
KLVLDKDMAEEG VLE E AEFYNI GPLI RI I KDRMEEKDY TVTQVP 
PKHVYRVLQCCSEELTO^STMSDGWRFEOLVNIGSSYKYGSED 
0AEFLCWSKELHSTPNGLSSESSRKTKSTEEOLEEO00OEEEV 
EEVEVE0V0VEADAOEK/CCYKPEAPGCEAPDHLOGLGVPI 


6830 


2 


935 


MKPGSVJSNLS I VYRSRDFL»WNKHWDVRIDSKAV?RETLT1>0KQL 
RyRFPELADPDTCYGFRFGHQLDFSTSGALCVALNKAAAGSAYR 
CFKERRVTKAYLALLRGHIOESRVTISHAIGRNSTEGRAHTMCI 
EGS0GCENPKPSLTDLWLE1IGLYAGDPVSKVLLKPLTGRTHQL 
RV\HCSALGHPWGDLTYGEVSGREDRPFRMMLHAFYLRIPTDT 
ECVEVCTPDPFLPSL.DACWSPHTLLOSLD0LVOALRATPDPDPE 
DRGPRPGSPSALLPGPGRPPPPPTKPPETEAORGPCLQWLSEWT 
LEPDS 


6831 




1087 


SLFFGSSTPDftKVAEOEDLETQPSPSVEKAVTVIDPEGTIPTNF 
NVAJEKPADHSLS EVKLKTADEPRGTL VKSGDGQNVKEKSM I LSN 
VEDLQOPKFISEVSREDYGKKEISGDSEEMNINSWTSADGENL 
EIOSYSLIGEKLVMEEAKTIVPPHVTDSKRVOKPAIAPPSKWNI 
SI FKEEPRSDOKQKSLLSFDWDKVPQQPKSASSNFAJSKN ITKE 
SEKPESIILPVEESKGSLIDFSEDRLKKEM0NPTSLK1SEEETK 
LRS YS PTEKKDNLENR \ S YTL\AEKKVLAEKONSV\ APLELRDS 
NETGKTOITLGSRSTELKESKADAMPQHFYQNEDYNERPKIIVG 
SEKEKDEKKKK 


6832 


1809 


412 


MGSGLISGPPCDNSGEALKEPERAQEHSLPNFAGGQHFFEYLLV 
VSLKKKRSEDDYEPI 17YQFPKRENLLRGQQEEEERLLKAI PLF 
CFPDGNEWASLTEYPR ETFS FVLTNVDGSRK I GYCRRLLPAGPG 
PRLPJOnf CI I SCIGCFGLFSKI LDEVEKRH01 SMAVI YPFMQGL 
REAAFPAPGKTVTLK5iFI PDSGTF.FI St»TRPLDSHLEHVDFSSL 
LHCLSFEQILOIFASAVLERKIIFLAEGLSTLSQCIHAAAALLY 

Pr bWAHl I Irv vPESLIlAT. VLLP1 rr r) V(jVyKKryyfc.VMJJi>PMr. 
EVLLVNLCEGTFLMSVGDEKDILPPKLQDDILDSLGQG1KELKT 
AEQINEHVSGPFVQFFVKIVGHYASYIKREANGQGHFOERSFCK 
ALTSKTNRRFVKKFVKT0LFSLFIQEAEKSKNPPAGYFOOKILE 
YEEQKKQ/TETKGKNCE I RAWNKND 


6833 


1 


1125 


PLMTLSQCGG1 PGHGHSKGGHGHGHGLPKGPRVKSTRPGSSDIN 
V APGEQGPDQEETNT LV ANTSN SNGLKLD P ADP EN PR SGDTVEV 
QWGNLVREPDHMELEEDRAGQLNMRGVFLHVLGDALGSVIVVV 
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NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first, 
amino acid 
residue of. 
amino acic 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amine acid segment containing signal peptide 
{A=Alanine, C=Cysteine, D=Aspartic Acid/ E~ 
Glutamic Acid, F=Phenyl alanine, G^Glycine, 
H=Histidine, 1=1 soleuc: ne, K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine , 
PrProline, 0>Glutamine, R=Arainine, 
SsSerine, T=Threonine, V^Veline, 
WsTryptcphan, Y=Tyrosine, X=*Unknown, *=Stop 
Codon, /'-possible nucleotide deletion, 
\=possible nucleotide insertion) 








NALVFYFSWKGCSEGDFCVNPCFPDPCKAFVEIINSTHASVYEA 
GPCWVLYLDPTLCWMVCILLyTTYPLLKESALILLQTVPKQlD 
IRNLIKELRNVEGVEEVHEIiHVWOLAGSRIJATAHIKCEDPTSY 
M EV AKT 1 KD V FHNKG 2 HATT 10 P E FAS VGS KS S W PCELACRTQ 
CAliKQCCGTLPQAPSGKDAEKT P AVS IS CLELSNNLE KKPRRTK 
AENIPA\WIEIKN\IPNK\QPFSSL 


6834 


78 


1151 


AG0ERPAP1WRLLK1,PTPSVSRKAEPAKIF1NR*GA*E*RGGLP 
LCGSSASAYGWH* RLTPWSPGGS * S S KA P VTC ARE VI.VAG P 
CSKLVLSG ARG I VGTTVQVLVEAOQPLI-LLFTGVWGLNLRAGEE 
SRAL * LI E E V TOVRDAHLGNAWG CAQCLSQG QVGS ALAKALLE 
AAAA VRDCKEVLTVSGDXQQAEVS VRL * VRDVCVEEAGCVEFGQ 
AHGRPGLALAKGRGGTNEVEEQVO^GVQKLVLSAHECHELVAG 
QQDG EDQAARTRLLQAG AHS VAHGRROGOAP C RPHQE AGVSCHE 
LOOWGDAL * ARE * APQ 1 1 VLLLLED VAQLR7GK KA * DLWDVE 
OLLROL 


6835 


j 


834 


G I PAADR \ EASLELI KLD J SRTFPNLCI FQQGGP YHDMLHS ILG 
AYTCYRPDVGYVQGMS FIAAVL2 LN LDTADA F I A FSN LLNKPCQ 
MAFFRVDKGLMLTY FAAFE VFFEENLPKLFAH FKKNNLTPDI YL 
2DW1 FTL YSKSLPLDLACR I WDVF CR DGEEFLFR TALG I LXJbFE 
D3 1/TKMDF 1 HMAQFLTRLPEDLP AEELFASI ATI QMQSRNKKWA 
QVLTALQKDS REMREG KS VPPTLRLQRE F ALGTNCS PMPRPLCC 
FRLTPGOPRRTDAL 


6836 


i 


850 


MSCGRPPPDVDGMITLKV\DNLTYRTSPDSLRRVFEKYGRVGDV 
Y I PREP.KTKAPRGFAFVRFHDRRDAQDAE AAMDGAELDGRELRV 
OVARYGRRDLPRSRQGRRHAAGPE7^A/RYGRRSRSYGRRSRSPR 
RRHRSRSRGPSCSRSRSRSRYRGSRYSRSFYSR5PYSRSRYSRS 
PYSRSRYRESRYGGSHYSSSGYSKSRYSRYKSSRSHSKSGSSTS 
SRSASTSKSSSARRSKSSSVSRSRSRSRSSSMTRSPPRVSKRKS 
KSRSRSKRPPKSPEEEGQMSS 


6837 




1369 


TDG AAV AGN PGSDY FPGGTAP/ GG PRTRR F \ SGTS S SGS KASG P 
PNPPAQGDGTSLSPNYTLESTSGWDGKPVSGGGGRGRGRRKRDS 
GHVS PGTFFDKYSAAPDSGGAPGVS PGQQGAS GAAVGGS S AGBT 
RGAPTPHEKALTSPSWGKGAELLLGDQPDLIGSIaDGGAKSDSSS 
PNVGE FAS DE VS TS YANEDE VS S S S DM PQALV KAS R S PLVTGSP 
KLPPRGVGAGEHG PKAP PPALGLG a MSNSTS TPDS YGGGGGPGH 
PGTPGLEQVRTPTSSSGAPPPDE 2 KPuEI LCAOI QLQRQQFSIS 
EDQPLGLKGG KKGECAVGA5 GAONG DS E LGS CCS EA VKS AMS T I 
DLDSLMAEKSAAWYMPADKALVDSADDDKTLAPWEKAKPQNPNS 
KEAHDLPANKASASQPGSHLOCLEVHCTDDVGDAKARASVPrWR 
SLKSDI SNRFGTFVAALT 


6838 






bTDTP PPKTHMI HHS1 SDYKATLR CMALGFYPME I TLTWQQPEE 
DQTRDMELVETRPAGDGTFQKWAAWVPSGEE /Q/RYMCHVQHE 
GLPEPLTLRWEQSSQPTIPIVGIVAGLVLLGAWTGAWSAVMC 
RKKNSDRVSYSEAASSDHAQGSDVSLTACKV 


6839 




1195 


AAP AGGG PDP EALS AF PGRHLSG LS W PQ VKRLDALLS EP I P I HG 
RGNFPTLSVQPRQIRAGGPQHPGGAGMHVHRVRLHGSAASHVL 
HPE SGLG Y KDLDLVFRMDLRS EA£ FQLTKAWLACLLDFLPAGV 
SRAK3 TPLTLKEAYVQKLVKVCTDSDRWSL3 SLSNKSGKNVELK 

ESLYGDFTEALEHLRHRVI ATRS PEE I RGGGLLKYCHLLVRGFR 
PR PS TDVRALQR YMCSRFF I DFPDLVEQRR TLE R YLEAHFGGAD 
AARR YACLVTLHRWNESTVCLMNKERRQTLDL I AA1ALQALAE 
0GPAATAA1AWRPPGTDGWPATVNYYVTPV0PLLAHAYPTWLP 
CN 


6840 


4254 


2061 


ELCGDFSVPDVPKSMAWCENSICVGFKRDYYLIRVDGKCSSIKEL 
FPTGKOLEPLVAPIJuOGKVAVGQDDLTVVbNEEGICTQKCALNW 



560 
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SEQ 
3D 
NO: 


Predicted 
beginning 
nucl eot ide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
eequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amine acid segment containing signal peptide 
{A* Alanine, C=Cysteine, D=Aspartic Acid, E= 

li.ciul ttLi.u, r — triiCTiyXaXa ill ;lc , o— o J yc jl im , 

K=K:stidine, 3 = Isoleucine, K-Lysine, 

L-Leucine, M=Methionine , N=Asparagine , 

P^Proline, Q^Glutamine, R=Arginine, 

S= Serine, T°Threonine , Vt=Vei:nt, 

K= Tryptophan, Y=Tyrosine, X=Unknown, *«Stop 

Coccn, /=possible nucleotide deletion, 

\=pcssible nucleotide insertion) 








703 ?VAMEHQPPY3 3AVLPRYVEIRTFEFRLLVQS3ELQRPRFI 
TSGGSN3 1 YVASNHFVWRL3 PVPMATQ3 0QLLQDK0FELALQLA 
EM KDDSDS E KQQQ I HH I KNLYAFNLFCOKR FDESMQV FAKLGTD 
P TH V MGLY P DLl>P 7D YRXQLQ Y PNPL F VI S G AE LE KAHLALI D Y 
LTOKRSOI-'VKKLNDSDHOSSTSPLMEGTFTIKSKKKLLOIIDTT 
LLKC YLKTNVALVAPLLRLEK?CHCHI EESEHVLKKAHKYSELI I 
LY BK KGLH E KALQVIjVDQSK KANS PLKGK ERTVQ Y LQHLGTENL 

K1 rev c\nj\r7 utitt'D'C'T^'^t vi nTM dc\tci dd r>m rT nur tpxi 
tiljj. r o IcVW VLiKJJr rtlJ»jLil\i 1 1 r>Uijf fcv £.£>brKDK Viiljr L>J.fc,N 

FKGlaAlPYLEHIIHVViEETG£RFHNCLlQLYCEKVOGl»MKEYI»L 

SFFAGKTPVPAGEEEGELGEYRQKLLMFLE3SSYYDPGRLICDF 

PFDGLLEERALLLGRI^GKHEOALFIYVHILKDTRKAEEYCHKHY 

DRMKDGNKDWLSLLRMYLSFPSIHCLjGPJKbELLEPKANLQAA 

LOVLSLHHS KLDTTKALNbLF ANTQIND 2 RI FLEKVLEENAQKK 

RFN0VLKKLLHAEFLRV\QEERILHQOVKCIITEEKVCMVCKKK 

I GKSAFARY PNGWVHY FCS\ KE VNPADT 


6641 




3206 


TPS"J"J'GTKSNTPTSSVPSAAVTPLNESLQPLGDYGVGSKNSKRA 
RE KR DSRNMEVQVTQEMRNVS 3 GMGSSDEWS DVQD3 1 VS TPELD 
MCFETRLDRTGSSPTQGIVNKAFGINTDSLYHELSTAGSEVIGD 
VDFG 7u) LLG E FSGMG KE VGNLLLENSQLLET KNALNWKNDLI A 
KVDOLSGEOEVLRGELEAAKQAKVKLEN'RaKELEEELKRVKSEA 
1 3 AR RE PKEEAEDVSS YLCTES DKI PKAORRRFTRVEMARVLME 
RN0YKERLMELOEAVRWTEMIRASREHPSVOEKKKST3WQFFSR 
LFSSSSSPPPAKRPYPSGN3HYKSPTTAGFSQRRNHAMCP1SAG 
SRPLEFFPDDDCTSSARREQKREQYRQVREKVRNDDGRLOACGW 
SLPA KYKQLS PNGGCEDTRMKlWPVPVy CRPLVEKDPTMKLWCA 
AGWLSGWRPNEDDAGNGVKPAPGRDPLTCDREGDGEPKSAHTS 
PEKKKAKELPEMDATSSRVWI LTSTLTTSKWI I DANQPGTWD 
QF7\ f CNAHVLCISS3PAASDSDYPPGEMFLDSDVNPEDPGADGV 
LAG 3 TLVGCATRCNVPRSNCSS RGDTPVLEKGOGEVATI ANGKV 
NFSCSTEEATEATEVFDPGPSEPETATLRPGPLTEHVFTDPAPT 
PSSGP0PGSENGPEPDSSSTRPEPEPSGDP1GAGSSAAPTMV3LG 
AQNG W L Y VH S A VAN W K KCLH S J KLKDS V LSL VHVKGR VLVAJjAD 
Wi JUm J V HRGEDGQWDLSN YHL^LGH PhHS 3 RCI^VVYDRVWCG 
YKNKVHVI QPKTMQ I EKS FDAK FRRESQVRCLAW 3 GDGVWVS I R 
LD.9TLRLYHAHTHQKL0DVDI E P YVSKMLG7G KLGFS FVRI TAL 
LVAGS RLWVGTGNGW3 S IPL7ETWLHR&0\LLG\LRANKTSP 
TSGEG \ ARPGG\ 1 3HVYG\DDSSDRAARSF3 PYCSMAQAQLCFH 
GHR D A V KF FVS V PGN VLATbNG S VLDS P AE G PGP AA PAS E VEGQ 
KL.RNYLVLSGGEGY 3 DFRIGlXiEDDETEEG&GDMSQVKPVLSKA 
ER SH 3 1 VWQVSYTPE 


6842 


3 


926 


RCOC IS AT I LTDK0YLERTPLCA1LKQ KAFOQYR I RAKLRS YKP 
RRLFQS VKLHCPKCHLLQEVPK EGDLD 1 3 FODGATKTPDVKLQN 
TSLYES K3 WTTKNQKGRKVAVK FVKNNG 3LPLSNECLLLI EGGT 
LSE3 CKLSNKFNSV3 PVRSGHEPLELLDLSAPFL3QGTVKHYGC 
KQWS7* RS I QNLNSLVDKTSW3 r SSVAEALG3 VPLQYVFVM7FT 
LDDG TG VLEA YLM DS D KF FQ 3 PAS EVLM DDDLQKS VDM I MDMFC 
P ?G I K 1 DAY PWLEC F 3 KS YNVT tfGTDNQ 3 C Y Q I FDTT VAEDVI 


6643 


2 


851 


NHR K V* h S G AKR YE CNECGKS FA Y TSS L I KH3 R 3HTG ERP YE CS E 
CGRS FAEHSSLI KHLRVHTGER ? YECVECGKSFRRSSSLLQHQR 
VHTREF. P YECSECGKS FSLRSKL.I HKORVHTGERHECGOCGKS F 
SRKSSLI3KLRVHTGERPYECSDCGKSFAEK T SSL3KHLRVHTGE 
RPYEC1DCGKSFRHSSSFRRH0RVHTGMRPVK*SKFWKFSCPGF 
LLLOG0RVHTGSRCYECDKWG 1 FFS *NASFFT* KSAPTEEVPFE 
CNE CE KA FS PLSLVTTI FT 


6844 


244 


642 


EHOIAGFELRKTOTSMSLGTTREKTDRVKSTAYLSPOELEDVFY 
QYD VK £ E 3 YS FG 3 Vlj WE I ATGD 3 P FQG CNS 5 K 3R KL VAVKRQQE 
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SEC 
ID 
NO: 


Predicted 
bee inning 
nucl eot idt 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 

corresponding 
to first 
araino acid 
residue of 
amino acid 
sequence 


Ammo acid segment containing signal peptide 
lA=Alanine, C=Cysteine, D^Aspartic Acid, Es 

K^Histicine, l^Isoleucine, K=Lysine, 

L= Leucine, M=Methionine, I$=Asparagine , 

P=Proline, Q=Glutamine, R^Arginine, 

S^Serine, T=Threonine, V=Valine, 

W= Tryptophan, Y=Tyrosine, X= Unknown, **Stop 

Codon, /=possible nucleotide deletion, 

Vpossibie nucleotide insertion) 








PLGEDCP£ELREI2DECRAHDPSVRPSVDEILKKbSTFSK*CIK 
3 


6B45 


3 


151S 


V A V R DEC Y K RH V F W DQD LWKL L F I LMCKP ETARARLE Y R I RTLB 
GALENAQNLGYQGAKFAWESADSGLEVCPEDIYGVQEVHVNGAV 
GLAFELYYHTTQDLOLFREAGGWDWRAVASFWCSRVEWSPREE 

K YHLRGVNS pdeyhsg vnns vytnvlvqnslrfaaalaqdlglp 

ypvpfslspdvrrknleiyeavtspogpamtwsmfavgwmelkd 
avrarglldrs fankae pfkvwtenadgsgavnfltgmgg flqa 
vyfgctgfrvtragvtfdpvclsgisrvsvsgifycgnkl^fsf 
sedsvtvevtaragpwapkleaelwpsosrlsllpgkkvsfprs 

AGRIQMSFPKLPGSSSSEFPGRTFSDVRDPLQSPLWVTLGSSSP 
TESLTVDPASE*SGTGASETSLGPSLWFRLHPPLLGTLLACHPS 
PAARLSGKVHAAWPEFKAFCij 


CO AC 


212 


1256 


LYFLKTIK* Lr^RI^EHP*YENEKLTKLRNTIMEQYTRrLLSARG 
1 1 FTKTROSAYALSQW I TENEK FAE VG VKAHH LIGAGHS S EFKP 
KTQNEQKEV1SKFRTGKINLL1ATTVAEEGLDIKECK1V1RYGL 
VTNE I AMVOARGRARADESTYVLVAHSGSGVI EHETVND FREKM 
KY KAIHCVCKMKPEE YAHKI LELQMQS IKEKKMKTKRN 1 AKHYK - 
NNPSL1TFLCKNCSVLACSGEDIHVIEKKKHVNMTPEFKELYIV 
RENKTLQKKCADYQINGE1 1 CKCGQAWGTMMVHKGLDLPCLKIR 
KFVWFKNNSTKKOYKKWVELFITFPMIJDYSECCLFSDED 


6847 


1450 


346 


SMCWNSDRLEMPL1DLALILYPPSYVPYTGHLSDDSLSRKYCLT 
WFEDALNG VL* RAE AI QPHCVNAGDRMEKFRQKYWKLQT1»RQQ 
PFAYGTLTVRSLLDTREHCLNEFNFPDPYSKVKQRENGVALRCF 
FGWRSLDALGWEERQLALVKGLIAGMVFDWGAKAVSAVLESDP 
Y FG FE SAKR KLQER P WLVDS Y S E WLQR LKG PPH KCAL1 FAPNSG 
ID3 I LGVFPFVRELLLRGTEVILACNSGPALNDVTHSESLI VAE 
R I AGMDPWHSALREERLLLVOTGSS S PCLDL S RLDKGLAALVR 

bRt> ADIjVV J EG MbRA VHTN i HAAJja C_ L b IjKLA V I KNA WliAbX LG 
GRLFSVIFKYEVPAE 


6846 


19 


16 


AMWWNSLDG 1 RNI VLSNPKKRKTLSLAMLKSLQSDILKDADSND 
LKVI I ISAEGPVFS SGHDLKELTEEQGRDYHAEVFQTCSXVMMH 
I RN HP VP VI AMVNGLATAAGCQLVAS CD! AVASDKSSFATPGVN 
VGLFCSTPGVALARAVPRKVALEMLFTGEPISAQEALLHGLLNK 

V Vi't/lCJJV^*- •» »™Kx/lKAJ./10JJcXJrV v oJjOJVftl T J IVvJJrVJJ.Dvj.l>i 

YYLTS QAMVDNLALRBGQEG I TAFLQKR KPVWSHEPV* VEH 


6849 


70 


821 


SLGVDGSCLEQGSPAPRPQTDTS P * P VGNWATQOEDLYHOS YEC 
VCVLFASVPDFKEFYSESNINHEGLECLRLLNEIIADFDELLSK 
PKFSGVEK1 KT1 GSTY MAATGLNATS GQDAQQDAERSCSHLGTM 
VE FA VALGS KLDV1 N KHS FNN FRLR VG LNHGP WAG V J G AQ KPQ 
YDIWGNTVKVASRMESTGVLGKIQVTEETAWALQSLGYTCYSRG 
VI KVKGKGQLCTYFLNTDLTRTGPPSATLG 


6850 


2 


1235 


ARGLNHEWTFEKLR0HISRNA0OKQELHLFMLSGVPDAVFDLTD 
LDVJjKLELIFEAKl PAKISQMTNLQELHLCHCPAKVEQTAFSFL 

GLES LRELRKLKI LHVKSN LTKVPSN I TDVAPHLTKLV I HNDGT 
KLLVLNSLKKNMNVAELELONCELERIPKAIFSLSNLOELDLKS 
KN1RT1EE1 3 SFOHLKRLTCLKLWHNKI VTIPPSITHVKNLESL 
YFSNNKLESLPVAVFSLOKLRCLDVSYNNISMIPIEIGLLOWLQ 
HLH I TGNKVD 1 LP KQLF KC I KLRTLN LG QNCI TS LPE KVGQLSQ 
LT0LELKGNCLDRLPA0LGOCRMLKKSGLWEDHLFDTLPLEVK 
EALNQD1NIPFANGI 


6851 


1765 


660 


VSAOVSAREGENCLGVJNLADSSQESYKSLEEAEDCYPPSLLTLD 
LRDLFNQVEOGPLLS CPKAGTDLSMGRAREVGWMAAGLM 1 GAGA 
CYCVYKLTIGRDDSEKLEEEGEEEWDDDQELDEEEPDIWFDFET 



562 
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SEO 
IE 
NO: 


Predicted 
beginninc 
nucleot ide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence v 


Predicted end 
nucj eotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment, containing Bignal peptide | 
(A=Alanine, OCysteine, D=Aspartic Acid, E= 1 
Glutamic Acid, F=Pbenylalanine, G=G3ycine, 
H=Kistidine, 2=1 sol eucine , K=Lysine, 
L=Leucine, M=Kethicnine, K^Asparagine . 
P=Vroline, O^Glu t amine , R=Arcinine, 
S=Serine, T=Threonine, V=Va3ine, 
W^Tryptophan, Y=Tyrosine, X=Unknown, '-Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








MARPWTEDGDWTEPGAPGGTEDRPSGGGKANRAHP1K0KPFPYE 
HKNTWSAgNCKNGSCVLDLSKCLFlOGKLLFAEPKDAGFPFSQD 
INSHI^SLS^^TSPTPDPTVKEALCAPDNLNASIESOGOIKN 
YINEVCKETVSRCCNS FLQQAGLNLL2 SMTVINNMLAKSASDLK 
FPLISEGSGCAKVOVLKPLP^GLSEKPVLAGELVGAQMLFSFMSL 
F2 RNGNRE 1 LLETPAP 


6852 


1 


407 


RTRGEETYANFI KHNDGKNI FYAARTPATLFAVMFAMy 1 1 SGLT 
GF2GLNS I AVLCNL VMGLALI FLCTWAY VKYSGEFRE 3 GTVIDQ 
IAETLWEQVLKPLGDNLMEENIR0SVTNS3KAGLTD0VSHHARL 
KTD 


6 853 


3 


469 


GDSCAVCIELYKPNDLVRILTCNHIFHKTCVDPWLLEHRTCPMC 
KCDILKAIX3IEVDVEDGSVSLOVPVSNE3FNSASSHEEDNRSET 
ASSGYASVOGTYEPPLEEHVQSTNESLOLVNHEANSVAVDVIPH 
VDNPTFEEDETPNOETAVREIKS 


6854 


1148 


585 


KES Y 1 GTFD PG ELCVCAA1 OWLQDNS AS Y FLNRKLVY E F STQAK 
PVKNTFLRMW3 YSHHI YQ0DLRKK1 LDVGKRLDVTGFCKTGKPG 
1 1 CVEGFKEHCEEFWHT I RYPNWKHI ? CKKAESVETEGNG2DLR 
\ iPUQ ~PT?T 1 .PIvPnnvm .R^TCYVMNI .GOF1 .ffi.kkhkclfvi vrgi 

X>r no • Pi Pr I f i n 1 r ~ v 3 kj 2 \jXjJ\-^*lj j iii'ji ▼ v ■ r 1 >n- r u /v xvn 1 \ 0 dhv r v 

LFGIESKSSDS 


6855 


1913 


1146 


GRVGGRVGR3 CSPLSGANEYIASTDTLKTEEVLLFTDQ7DDLAK 
EEPTSLFQRDSETKGESGLVLEGDKEIKQ3FEDLDKKLALASRF 
YI PEGCIORWAAEMWALDALHREG I VCRDLNPNNI LLNDRGH I 
nT.TYF^T?W^FWDKCT)^DAIERMYCAPFVfiAITEETEACDWWSL 
GAVL FELLTG KTLVECH PAG I NTHTTLMMPEW VSEEARS LI QQL 
LQFNPLERLGAGVAGVEDI KSHPFFTPVDWAELMR 


6856 


1617 


■ 997 


VTOLYVSVDASTKDSLKKIDRPLFKDFWOOFLDSLKAI^.VKQOR 
TVYR LTLVKAWtfVDELQAYAQLVSLGNFDFI EVKGVTYCGES SA 
^T/PM&HVPWKF^VVOFVRFLVDT.I PFYEI ACEHEHSNCLLI AH 
RKFKIGGEWVJTWirCYKRFOELlOEYEDSGGSKTFSAKDYhlARTP 
HWALFGASERGFDPKDTRH0RKNKSKA3 SGC 


685*? 


1 


617 


"KGPEATAMVCVCSKPNCROKHIKPSHSAAOTMCGSPTPASAPNH 
KLMAMEQGKTL PS ATEDAKEEGLEAQ I £ RLAEL I GRLE£ KALWF 
DLO0RLSDEDGTNMHLQLVR0EMAVCPEQLSEFLDSLR0YLRGT 
TG VRN CFHI TAVRLSDG F? F V I Y EFW ETEEAW KRHLQS F L C KAF 
RHVKVDTLSQF EALSRI LVPAAWCTVGRD 


6856 


2 


669 


RSRG 1 KDFENDP PLSSCG I FQSR 2 AGDA LLDSG I RI S S V FAS PA 
IiRCVOTAKLILEELKLEKK IKIRVEPG1 FEWTKWEAGKTTPTLM 
SLEELKEANFNIDTDYRPAFPLSALMPAESYOEYMDRCTASMVQ 
IVNTCPQDTGVILIVSHGSTLDSCTRFLLGLPPRECGDFAQLVR 
KIPSLGMCFCEENKEEGKWELVNPPVKTLTHGANAAFNWRNWIS 
GN 


6855 


1 


1150 


GETMFKKAKTKAKKKPRKRSDSSGGYKLSDIIQSPSSTGLLKSG 
KTNSVESLPELLTSDSEGSYAGVGSPRPLQSPDFTTGFHSDKIE 
AKVKP Y VNGTS PVYSREDLKPWEKS P 2 LK 1 5APQPIP SNR 1 DTT 
SSAS WVAGS FSP VS PPWBLRTI ME I E ESRQKCGATPKS H LG KT 
VSHGVKLSQKQR KM IALTTKENNSGMNS METVLFTPSKAP KP VN 
AWASSLHSVSSKSFRDFLLEEKKSVTSHSSGDHVKKVSFKGIEN 
SQAPKlVRCSTHGTPGPEGNHISDIiPLLDSPNPWLSSSVTAPSM 
VAPVTFAS I VEEELQQEAALI RSREKPLALIQIEEHAIGDLLVF 
YEAFGN P EEF V 3 VERTPOG F LAVPMWK KHGC 


6860 


1889 


1515 


DKDKKRQKKRGI FPKVATN 2 MRAWLFQHLTHPYPSEEQKKQLAQ 
DTGLTI LQVNNW F INARRI I VQPMI DQSNRAVSQGAAYS F EGQP 
MGS FVLDGQQKMG I RPAG PMSGMGMNMG MDGQWHYM 


6861 


1885 


1515 


DKDKKRQKKRGI FPKVATN 2MRAWLFQHLTHPYPSEEQKKQLAQ 
DTGLTI LQVNNWFI NARR3 2 VQPMI DQSKRAVSOGAAYS F EGQP 



563 
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SEQ 
ID 
NO: 


Fredictec 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predici.ec end 
nuclect i cf. 
locaticr- 
corre sr. ceding 
to firr; 
amino e-cic 
residue of 
amino tcic 
sequence 


Amino acid segment containing signal peptide | 
(A=Alanine, C=Cy£teine, D=Aspartic Acid, E«= i 
Glutamic Acid, ?- Phenylalanine, G=Glycine, \ 
H^Hietddine, l«=lscle*jcine, K^Lysine, 
LeLeucine, M=Methicni ne, N=Asparagine , 
P=Proline, Q=Glutami!ie, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X^Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








MGSFVLDGQQHMGIRPAGPHSGMGMNMGMDGQWHYM 


6662 


2 




EEIDREFHNKLKLKEDKLEKQEKPVNGEDKGDSGVDTQNSEGNA 
DEED PLG PN CY YD KT K S F FDK I SCDDNRERR PTWAEE R R LNAET 
FGIPLRPNRGRGGYRGRGGLGFRGGRGRGGGRGGTFTAPRGFRG 
G FRGGRGGRE FAD FEYRXTTA FG P 


6863 


2216 




PQE PALKSE FSQVASNT I PLP LPQPNTCKDNGPCKQVCSTVGGS 
A1CSCFPGYAIMADGVSCEDCDECLMGAKDCSRR0FCWTLGSF 
Y CVNHTVLCADGY I LNAHR KC VDI WECVTDLHTCSRGEHC VNTL 
GS FHCY KALTCEPG YALKDGE CEDVDECAMGTHTOQPG FLCQKT 
KGSFYCQARQRCMEX3FLQDPEGNCVDINECTS1.SEPCRPGFSCI 
NT VG S YTCORNPLI C ARG Y K AS DDG T KCVDVN ECETG VKR CGEG 
QVCHNLPGSYRCDCKAGFORDAFGRGCIDVNECWASPGRliCOHT 
CENTLGSYRCSCASGFLLAADGKRCEDVNECEAQRCSQECANIY 
GSYQCYCROGYQLAEDGHTCTDIDECAQGAGILCTFRCLNTVPGS 
YQCACPEQGYTMTANGR S C KDVDEC ALGTHN C S EAETCHN IQGS 
FRCLRFECPPNYVQVSKTKCERTTCHDFLECONS PAR I THYOLN 
FQTGLLVPAHI FRIGPAPAFTGDT1 ALNI I KGNEEGY FGTRRLN 
A YTGWY LQRA VLEP R D F ALD V EMK bWRQGS VTTFLAKMK I FFT 
TFAL 


6864 


2 




LADSSPSNLQIIIKELLSMHHOPDPALTKEFDYLPPVDSRSSSG 
FVG LRNGGATCYMNAVFOOLY MOPGLPSGLLS VDDDTDN PDDSV 
FYQVQSLFGHLMES KLQYYVP ENFWKI FKMWNKELYVREQQDAY 
EFFTSLI DQMDEYLKKMGRDQI FKNTFQGIY SDOKICKDC PHRY 
EREEAFMALNLGVTSCCS LE I SLDQ FVRGEVLEG SNAY Y CEK CK 
EKRITVKRTCIKSLPSVLVIHljMRFGFDWESGRSIKYDEOIRFP 
WKLNME P YTVSGMARQDSS S E VGENGRS VDQGGGG S P RKK VALT ! 
ENYEbVGVIVHSGQAHAGHYYSFIKDRRGCGKGKWYKFNDTVIE • 
E FDLNDE TL E YECFGGE Y R P KVYDQTN P YTDVRRR YWNA Y MLFY 
QRVSDQNSPVLPKKSRVSWROEAEDLSLSAPSSPEISPCSSPR 
PHR PNNDRLS I LTKLVKKG EK KGLFVEKMPARI YQMVRDENLKF 
MKNRD VYS SDY FS FVhS LAS LNATKLKHP Y x PCMAKVS LQLAI Q 
FL FQT YLRTKKKLR VDTE £ W I ATI E ALLSKS FDA COWL VE Y F I S 
SEGRELIKIFLLECNVREVRVAVAT3LEKTLDSALFYODKLKSL 
HQLLEVLLALLDKDVPENCKN CAQYFFLFNTFVQKQG I RAGDLL 
LRHSAIiRHMISFLLGASRONNOIRRWSSAOAREFGNLHNTVALL 
VLHSDVSSQRNVAPG I FKQRP ? ISIAP S SPLLPLHEE VEALLFM 
SEGKPYLLEVMFALRELTGSLLALIEMWYCCFCNEHFSFTMLH 
FIKNQLETAPPHEL1OJTF0LLHEILV1EDP1QVERVKFVFETEN 
GLLALMHHSNHVDSSRCYOCVKFLVTIiAQKCPAAKEYFKENSHH j 
WSWAVQWLOKKMSEKYWTLOSNVSNETSTGKTFQRTISAODTLA 
YATALLNEKEQSGSSNGSESSPANENGDRHLQQGSESPMMIGEL 
RSDLDDVDP 


6865 


1820 


1242 


DPERWKHLSKVTPPGSSVSTTPVQWRLQSPQSQGSMMPSCNRS 
CSCSRGPSVEDGKWYGVRSYLKLFYEGYAVPPKLEGIGEGEFLV 
LDORAADYNQALGTCRLAGTALCVAAGVLLAI CLFWAM IGWLS Q 
DTKAEPLDPEAD SH VE VFG DE P EQQLS P I FR NASGQS W FS F P AS 
PFGQSSVQT1QPKRDS 


6866 


1571 


4£; 


DCPRPRYTLYGLRATCMRDLDWAWINAVSAFKALEQDLPVN3KF 
I IEGMEEAGSVALEELVEKEKDRFFSGVDYI VI SDNLWI SQRKP 
AITYGTRGNSYFMVEVKCRDODFHSGTFGGILHEPMADLVAIiLG 
SLVDSSGHIIiVPGIYDEWPLTEEEINTYKAIHLDLEEYRNSSR 
VE K FbFDTKE E I LMH LWR Y PS L S I HG I EGAFDEPGTKTV I PGR V 
IGKFSIRliVPHMNVSAVEKQVTRHLEDVFSKRNSSNKMWSMTL 
G LH P Wl AN I DDTQ YLAAKRA2 RTVFGTEPDMI RDGST1PI AKMF 
QEI VHKS WLI PLGAVDDGEHSQNEKINRWN Y I EGTKLFAAFFl* 
EMAQLH 



564 



BNSDOCID: <WO 0153312A1_L> 
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SKO 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
1 oca tier, 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid secrnent containing signal peptide i 
(A=Alanine, C=Cysteine / D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G^Glycine, 
H=Histidine, 1=3 soleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, 0=Giutamine, R=Arginine, 
S^Serine, T=Threonine, V=Valine, 
W«Tryptophan, Y=Tyrosir.e, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\s=possdble nucleotide insertion) 


6867 


2833 


1704 


GTR1MSQPKQKELAGFVRQKMLLDYSVYMGRCVPQESRSP0RSP 
LOS AES S PTAG KKLP EVP PS EEEEQEAWVNALLGRI FWD FLGEK 
YWSDL V$ K K 1 QMKI »S K J K LP Y FMNELTLTELDMGVAVPK I LQAF 
KPYVDHOGLW I DLEMS YNGS FLKTLETKMNLTKLGKE PLVEALK 
VGE1GKEGCRPRAFCLADSDEESSSAGSSEEDDAPEPSGGDKQL 
LPGAEGYVGGHRTSK3MRFVDKITKSKYFQKATETEFIKKKIEE \ 
VSNTPLLLTVEVQECRGTLAVNIPPPPTDRVWYGFRKPPHVEIjK 
ARPKLGEREVTLVKVTDWIEKKbEOEFOKVFVMPNMDDVYITIK 
HSAMDPRSTSCLLKDPPVEAADQP 


6866 


1 


34( 


R PTRPPTR PEE I KNLI LPY2 SDMNFVQDLCEDFYELFKTDKGFD 
KATFESOMS VMRGOI LNLTOALRDGKSPFQLVQI PCV3 VERSQG 
GSQGU I VHLSNS FTQTVNCRKP FFSS W 


6869 


3 


1619 


MYMERMDKRALISFWESVEHLKNANKNEIPOIjVGEIYQNFFVES 
KE I S VEKSLY KE I QOCLVGN KG I EVFY KIQEDVYETLKDRYY PS 
FIVSDLYEKLLIKEEEKHASOMISNKDEKGPRDEAGEEAVDDGT 
NQINEQASFAVNKLRELNEKLEYKRQALNSIQNAPKPDKK3VSK 
LKDEI I LI EKERTDLOLHMARTDWWCENLGMWKASITSGEVTEE 
NGFOTlPCYFV^fV^T.^FVGGV^TlC^JWTVPl(RT.^FFHW? c?PP 
VPSLKKDOLPSLSKLPFKSIDHTFMEKFENQLNKFLQNLLSDER 
LCOSEALYAFLSPSPDYLKVIDVQGKKNSFSLSSFLERLPRDFF 
SHQEEETESDSDLSDYGDDVDGRKDALAEPCFMLIGEI FELRGM . 
FKWVRRTLI ALVQVTFGRT I N KOI RDTVSWI FS EOMLVY Y 3 N I F 
RDAFWPNGKLAPPTTIRSKEOSQETKQRAOOKLLENIPDMLOSL 
VGQQNARHGI 3KI FNALQETRANKHLLYALMELLLIELCPELRV 
HUDQLKAGQV 


6870 
- 


1 


156£ 


MAAWAATRWWOLLLVLSAAGMGASGAPQPPNILLLLMDDMGWG 
DLGVYGEPSRETPNLDRMAAEGLLFPNFYSANPLCSPSRAALLT 
GRLPIRNGFYTTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAG 
YVSKIVGKWHLGHRPOFHPLKHGFDEWFGSPNCHFGPYDNKARP 

ARHHPFFLYWAVDATHAPVYASKPFLGTSQRGR YGDAVRE I DDS 
3 GKI LELLQDLHVADNTFVFFTSDNGAALI S APEQGGSNG PFLC 
GKQTTFEGGMREPALiAWWPGHVTAGQVSHOLGSIMDLFTTSLAL 
AGLTPPSDRAI DGLNLLPTLLOGRLMDRP3 FY YRGDTLKAATLG 
QHKAHFWTWTNSWENFROGIDFCPGQNVSGVTTHNLEDHTKLPL 
IFHLGRDPGERFPLS FASAE YOEALS R 3 TS WQQHQEALVP AQ p 
QLNVCNWAVMNWAPPGCEKIX3 KCLTPPES I PKKCLWSH 


6871 


209 


112t 


RMSLNPP1FLKRSEENSSKFVETKQS0TTSIASEDPL0NLCLAS 
QEVLQKAOQSGRS KCLKCGGSRMFYCYTCYVPVENVP I EQI PLV 
KLPLK 3 D 3 1 KH PNE TDGKSTA I HAKLLAPEFVNI YTYPCI PE YE 
EKDHEVAL3 FPGPOS 1 SIKDI S FHLQKR IQNNVRGKNDDPDKPS 
FKRKRTEEQEFCDLNDSKCKGTTLKKI X FIDSTWNQTNK3 FTDE 
RLQGLLOVELKTRKTCFWRHQKGKPDTFLSTIEAIYYFLVDYMT 
DI LKEKYRGQYDNLLFFYS FMYQLI KNAKCSGDKETGKLTK 


6872 


880 


459 


FGLLMWLSLI FMKGNCVREDLI FNFLFKLGLDVRETNGLFGNT 
KKLITEVFVROKYLEYRRIPYTEPAEYEFLWGPRAFLETSKMLV 
LRFLAKLH KKDPQS W P FHYLEALAECEWEDTDEDEPBTGDSAHG 
PTSRPPPR 


6873 


1929 


955 


DEQAVLCSKDKTYDLKIADTSNMLLF3PGCKTPDQLKKEDSHCN 
3 IHTEI FGFSNNYWELRRRRPKLKKLKKLLMENPYEGPDSQKEK 
DSNSSKYTTEDLLDQIOASEEEIMTQLQVLNACKIGGYWRILEF 
D YEMKLLNHVTQL VDS ES WS FG K V PLNTCLQELG PLE PEEM I EH 
CLKCYGKKYVDEGEVYFELDADKI CPAAARMLLQNAVKFNLAEF 
QE VWQQS V PEGMVTS LDQLKGLALVDRHS RPE 1 1 FLLKVDD LPE 
DNQERFNSLFSLREKWTEEDIAPYIQDLCGEKQTIGALLTKYSH 
S SMQNG V KVYNSRR PIS 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acic 
sequence 


Predicted end 
nucleoside 
location 
corresponding 
to first 
amino acid 
residue cf 
amino acid 
sequence 


Am: no acid segment containing sipnal peptide 
(A^Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F= Phenylalanine, G=Giycine, 
K=Kistidine, 1= j soleucine, K=Lysine, 
L= leucine, M=Methionine, N=Asparaginfc , 
P=rroline, O-Glu t amine R=Argininc, 
S=Serine, T-Threonine, V= Valine, 
W=':ry?tophan, Y^Tyrosine, X=Unknown, *=Stop 
Cocon, /=possibje nucleotide deletion, 
\=xossible nucleotide insertion) 


6874 


1 


3 07 


D5 3 ADH VNS AA VNV E E G 7KNLG KAAK YKLAALP VAGAL I GGMVG 
GF3GLLAGFKVAGIAAALGGGVLGFTGGKLIQRKKCKMMEKLTS 
SCPDLPSQTDKKCS 


6875 


i68e 


34S 


VI G7G2RGNSASEKWE } MFNEELGDPFI I1HS I S LLN AEEHS I A 
TLLLRIEKEELDMKGSGFYVSLEVTVTISKKNODNKKYEIIKRDI 
LRGKSVFHYAA1EPDGKGLMIVSYKSLTFVQAG0DL-EENMDEDI 
SE K 7 KEPLYYW00TEDDLTVTIRLPEDNTKED1 01 QFLPDH1N3 
VLKDKQFLEGKLYSSI DHESSTWI IKESNSLE1 SL1 KKNEGLTW 
PELVJGDKQGELIRDSAOCAAIAERLMHLTSEELNPNPDKEKPP 
CKAOELEECDI FFEESS SI.CRFDGNTLKTTHWNLGSNQYLFSV 
I Vi: P KEMPCFCLRHDVDALLWQPHSS KQDDMWE H I AT FNALGYV 
QAS KRDKKFFACAPNY E YAALCECLRRVFI YRQ PAPKSTVLYNR 
KEG RQVGQ VAK00VAS LETNDP 1 LGFQATNERLFVLTTKNLFLI 
KWTEN 


6876 


41 


1285 


VGEMTLlWRHLbRPLCLVTSAPRILEMHPFLSLGl'SRTSVTKLS 
LHT K PR MP PCDFMPER YQV I FLVNSGSEANELAI^LMARAH SNNI 
Dl 1 SFRGAYHGCSPYTLGLTNVGIYKMELPGGTGCQFTMCPDVF 
RGPV?GGSHCRDSPVQT3 RKCSCAPDCCQAKDQYI EOFKDTLSTS 
VAK S I AGF FAE P I QG VNG VVQY PKG FLKEAFEL V RAR GG V C I AN 
EVCTGFC3RLGSHFWGF0THDVLPDI VTMAKG3 GNG FFMAAVI TT 
PEjAKSLAKCLOHFNTFGGWPMACAIGSAVLEVIKEENLOENSQ 
EVGTYML.LKFAKLRDEFEIVGDVRGKGLMIGIEMVQDKISCRPL 
PREE\ , NOIHEDCKHMGLLVGRGSIFSQTFRIAPSKCITKPEVDF 
AVEV FRSALTQHMERRAK 


6877 


3 




GTSPSPARAYAPPTERKRFY0NVS1TQGEGGFEINLDHRKLKTP 
0AKLFTVPSEALAIAVA7EWDSQQDTIKYYTMHLTTL.CNTSLDN 
PTQI^ KDQLI RAAVKFLDTDTI CYHVEEPETLVELORKEWDPI I 
EWAFKRYGVE 1SSSTS1 MG ?S I PAKTREVLVSHLAS YKTWALQG 
IEFVAAQLKSMVLTLGL3 DLRLTVEQAVLLSRLEEEYC'I OKWGN 
I EW AMD YELQELRARTAAGTLF IHLCS ESTTVKH KLLKE 


6878 


93j 


2kz 


QTLOGDFKN'RAEMI DFNJ RI KNVTRSDAGKYRCEVSAFSEQGQN 
LEE DT VTLE VLVAPAVP S CEVP S £ ALS GTWELR CODK EGN PAP 
E YTV; F KDG I RLLEN PR LGS QSTNS S YTMNTKTGTLQ FNT VS KLD 
TGEY S CE ARNS VG YRR CPG KRMQVDDLN I SG 1 1 AA WWAL V I S 
VCGLGVCYAQRKGYFSKETSFQKSNSSSKATTMSENDFKHTXSF 
11 


6879 




84 5 


IRV^GESDJMQEFLSESDENYNGVSDVELRVALPDGTTVTVRVK 
KNS TTDOVYQAI AAKVGKDSTTVNYFALFEVI SHSFVR KLAPKE 
FPK KLY 1 ON YTS AVPGTCLTI RKWLFTT E EEI LLN DNT LA VTYF 
FH OA V DDV KKG Y I KAE E K £ Y QLQKL YEQRKMVM Y LN MLRT CEG Y 
NE3 j FPIICACDSRRKGHVITAISITHFKLHACTEEGOLENQVIA 
FEWDEMQRWDTDEEGMAFCFEYARGEKKPRWVKI FTPYFNYMHE 
CFERVFCELKWRKEEY 


6B80 


2110 


1437 


RKDNCTAKEWTFPEAKWNTTARVFSHIRLGMGHVLI I VQCFI SS 
MAN3 YNEKI LKEGNQLTES I FIQNSKLYFFGILFNGLTLGLQRS 
NRDC^ KN CG F FYGHRAFS VAL I FVTAFQGLSVA F I LKFLDNM7H 
VLN'AQ VTTVI I TTVS VLVFDFRPSLEFFLEAPS VLLS 1 FIYNAS 
KP0VPEYAPR0ERIRDLSGNLWER5SGDGEELERLTKPKSDESD 
EDT7 


6881 


263£ 


2244 


NDSKWEDJ HVI'J'GALKNFFRELPEPJjFTFNHFNDFVNAI KQEPR 
0RVAAVKDLIROLPKPNODTMQILFRHLRRVIENGEKNRMTYQS 
I A I VFG PTLLKF E KETGKI AVHTVYQNQ I VELI LLELS £ I FGR 


6882 




850 


GI PF AOLWI YPV KS CKGVP V S EAECTAMGLRSGKLRDRFWLV I N 
QEGJvMVTARQEFRLVLI SLTCDGDTLTLSAAYTKDLLLPI ktpt 
TNAVHKCR VHGLE I EGRDCGEATAQWITSFLKSCPYRLVHFEPH 
mPKRPHQIADLFRPKDOlAYSDTSPFLlLSEASLADLNSRLEK 
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SEQ 
ID 

NO: 


iredict ec 
beginning 
nucleotide 
j ocatiorj 
corresponding 
to first 
amino acid 
residue of 
e>mino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Arri;no acid segment containing signal peptide 
(A=Alanine, OCysttine, V-hsvtrtic Acid, B* 
Gjutatnic Acid, F=Fhenylalanine , G=Glycine, 
K«Kistidine, I=Isoi eucine , K= Lysine, 
L=Leucine, M=Methicnine, N=Asparagine, 
P=?roline, 0=Glutat7;ine / R^Arcinine, 
S = Serine, T=Threonine, V=Val:rie, 
VJ^Tryptophan, Y=Tyrosine, X^unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possibie nucleotide insertion) 








KV KATNFRPH I VI SGCDV YAEDSWDELL3 GDVELKRVMACSRC3 
L; TVDPDTGWSRKEPLETLKSYROCDPSErvKljyG 
VLENPGTJ KVGDPVYLLGC 


6883 


2754 


2256 


K S KbKLNOK LKLF I TLTYQVLS LHGWGPG I HLQKEGAFP VTQKR 
A LQLLYDLK VLN 7 I VLTA KG DEVKSGRS K PDSR I EKVTDHL EAL I 
DPFDLDVFTPHLNSNLHRLVQRTSVLFGLVTGTEN01APRSSTF 
NS0EPHN3LPLASSQIRFGLLPLSMTSTRKAKSTRNIETKAQYD 
ANC 


6884 






EFERVTAEAVKPRETSEPRAAAQRFCEKFPFL 


6885 


2S7 


1554 


STGOFWHVTDLHLDPTYHZTDDHTKVCAESKGANASNPGPFGDV 
LCDSPYQU LSAFDFI KNSGQEAS FM3 WTGDS PPHVPVPELSTD 
TVilTVITNMTTTIQSLFPNLOVFPALGNHDYWPQDQLSWTSKV 
yNAVANLWKFWLDBEAl STLRKGGFYS0KVTTNPNLR1 ISLNTN 
LYYGPNIKTLNKTDPANOFEWLESTLNNSOONKEKVYIIAHVPV 
GYLPS SQN 3 TAMREY YNE KLI DI FOKY SDV I AGQF YGHTHR DS 1 
KVLSDXKGSPVNSLFVAPAVTPVKSVLEKOTNNPGIRLFQYDPR 
DY KLLDMLCY Y LNLTEANLKGES IW KLEY ILTQTYDI EDLQPES 
LYGLAXQFT1 ldskqfi ky ynyffvs YDSSVTCDKTCKAFQI ca 
IMNLDNISYADCLKQLY1KHNY 


6886 


? 


1341 


OCGGIPGREGGSSRPLEEGTGSSPACVRGA^PGSEDAFYPTRAK 
CAK VGQELKKAAXRTVS IS EGPDTLGDGf4RERRETLALAPEPEP 
LEKEACEKWKRPFRSASATSLTLSHCVDWKGLLDFKKRRGHS1 
GGAPEORYC: 1 PVCVAARL PTRAQDVLDAHLSEVNAVR FGPNS S 
hU\ TGGADKLIHLWNWGS 5 LEANQTLEGAGGS I TS VDFDPSGY 
OV1AATYN0AAQLWKVGEA0SKETLSGHKDKVTAAKFKLTRHQA 
VlGSRDRTVKEWDLGRAyCSRTINVLSYCNDWCGDHIIISGHN 
DQK2 RFWDSRG PHCTQVI PVQGRVTSliSLSKDQLHLLSCSRDNT 
LKV I DLRVSN 1 RQVFRADGF KCGSDWTKAVFS PDRS YALAG S CD 
GALYIWDVDTGKLESRLOGPHCAAVNAVAWCYSGSHMVSVDQGR 
KWLWQ 


6887 


1047 


116 


KTARPSQKFFWEAGAVPGDPLSTGCSQAOLGGCCPRGPWGPQHG 
GQCRAAGPTLPRGERGGPOQSGPGLAAOTPPTSKQVAWRAFLTG 
TYRSOSPRSFAGPFRGGTGWWPEPAVCLCVAVGPQRLSSPGLVy 
NASGS EHCYDI YRL YHSCADPTGOGTG PDA^AWD YOACTE INLT 
FASNNVTDKFPDLPFTDELRORYCLBTWGWFRPDWIiLTSFWGG 
DL- RAAS Nil F SNGNLDP W AGGGI RRNL S AS V I AVT I QGGAKHLD 
LKASHPEDPAS WEARKLEATI 3 GEWVKAAR REQQPALRGGPRL 
Si- 


6888 


J 


992 


FVAYVKKEI PHI WTHCLLKPHALVI KTLPTKLRDALFTVVRVI 
JJF1 KGRA PI\TK RLFQAFFEE j G 2 E YS VLL FHTEMRWL SRGQ I LTH 
iFEKYEEIROFTjKHKSSNT-^GFENKEFKIHLAYLADLFKHLNE 
LSASMQRTGMI^TVSAREKLSAFVRKFPFWCKRIEKRNFTNFPFL 
EE 3 1 VSDNEG I FI AAEITLKLQQLSNFFHGYFS IGDLNEASKW I 
LDPFLFNJ DFVDDS YLMKNDliASLRASGOI L-MEFETMKLEDFWC 
AQFTAFPNLAKTALE I LMP FATT YLCELGFS I TFTFQNKVPEAA 
LI LSDDIRVAISKKVPSFLGHH 


6889 


J 


1534 


LTLSNQIKEEREQDNSESPNGRTSPLVSONMEQGSTLRDLLTTT 
AGKJLRVGSTDAGIAFAPVYSMGAPSSKSGRTMPNILDDIIASW 

I CEKHI LWLKDYKNSSWWKLFKECI^KOGQFAVVSGVHKFCMNTSL 
WKAESI SLDFGDHQADLLNCKDS I ISNAKVKE FWDGFEEVS KRQ 
KKKSGETV\'LKLKDWPSGEDFKTMMPARYEDLLKSLPLPEYCNP 
EGK FNLASKL PG FFVRPDLGPRLCSA YGWAAKDHDI GTTNLHI 
EV S D WN I LVYVG I AKGN G 2 LS KAG I LK K FE E EDIjDD I LRXRLK 
DSSEI PGALKKI YAGKDVDK I REFLQK I S KEQGLEVLPEHDPI R 
DOS W YVNKKLRORLiIiEEYGVRTWTIjI QFLGDA I VLPAGALHQVO 
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SEC 

ir> 

NO: 


Predicted n 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
arc.: no acid 
secuence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amine acid segment containing signal peptide 
(A=Aianine, OCysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H-Histidine, I=Iso2eucine, K=Lysine, 
L=Leucine, M=Methicnine, N^Asparagine, 
P=Proline, Q=Gl ut amine , R=Arcinine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y^Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possib!e nucleotide insertion) 








NFHSClQVTEDFVSPEHLVESFHLTOELRLbKEElNyDDKLQVK 
NILYHAVKEMVRAfcKIHEDEVDDMEEN 


6890 


3 


667 


TKACGKWIPLYLKRALWKKTAETCNSPPCGAKDSLIFGA1TCF 
TG FLGVDTGAGATR WCRLKTORADPLVCAVGMLGS A I FI CLI FV 
AAKSSI VGAYI CI F VGETLLFSNW AI TAD 1 LM YWI PTRRATAV 
ALOSFTSHLLGDAGSPYLIGFISDLIRQSTKDSPLWEFLSLGYA 
LMLCPFWVLGGMFFLATALFFVSDRARAEQQVKQLANPPASVK 
V 


6891 


1980 


1262 


LK I HQEbLS KELKLLRG I TI E S I I H I GLAAG keqfmqdasnvmq 
LLLKTQSHLraMEDNNPEVROAAAYGLGVHAQFGGDDYRSLCSE 
AVPLLVKV1 KRAHSKTKKNV1 ATENC1 S A 3 GK I LKFKPNCVNVU 
EVLPHKLSWLPLHEDKEEAI OTLS FLCDLI ESNHPW1GPNNSN 
LPK1 1 SI 1AEGKINETINYEDPCAKRLANWRQVQTSEDLWLEC 
VSOLDDEQQEALQELLNFA 


6B9i 


3 


876 


RSVAAASGPGAWGTDHYCLEbJjRKRDYEGYLCSLLLPAESRSSV 
FALRAFNVELAOVKDSVSEKTIGLMRMQFWKKTVED1YCDNPPK 
OPVA1 ELWKAVKRHNLTKRWLMKI VDEREKNLDDKAYRNI KELE 
NYAENTQSSbLYLTLEILGIKDLHADHAASHIGKAOGlVTCLRA 
TPYHGSRRKVFLPMD2CMLHGVS0EDFLRRNQDXNVRDVIYDIA 
SOAHLHLKHARSFHKTVPVKAFPAFLOTVSLEDFLKKIQRVDFD 


639; 


1 


842 


DGERKSMSVERTFSE1KKAEE0YSLCOELCSELAQDLQKKRLKG 
RTVT1 KbKNVNFEVKTRASTVSSWSTAEEl FAl AKELLXTEID 

ATECTLE KTDK DKFVKPLEKSHKKS F FDKKRS ERKWSHQDTFKC 
EAVNK0SFOTSQPFQVLKKK1WENLEISENSDDCQILTCPVCFR 
ACCClSl>EALNK}TraECLDGPSlSENFKMFSCSHVSATKVNKKE 
NrVPASSLCBKQDYEAH 


6894 


1*742 


1463 


TTLCKPbVPREHQFYETLPAEMRXFTPQYKGKSQLLEGLPHWRG 
DVRDRGHGRPWQPSLEPSLFPTLCFPSLSSFSSSWPSAQHLTPS 
VFNPW 


6895 


2379 


478 


VTYVELCDLASPTALLIMRTVLDLIVEDLQSTSEDKEQQYTSQT 
TRLLALLYALASHKACKXAI LHLINGTI KGDERYAEI FQDlibAL 
VRSPGDSVIRQQCVEYVTSI LQSLCDQDI ALILPSSSEGS ISEL 
EQbSNSLPNKELKTS I CDCLLATLAN S ES S YNCLLTCVRTMMFb 
AEHDYGLFHLKSSLRKNSSALHSLLKRWSTFSKDTGELASSFL 
EFMRQ2 LNSDTI GCCGDDUGbMEVEGAHTSRTMS INAAELKQLL 
OSKEESPENLFLELEXLVLEHSKDDDNLDSLLDSWGLKOMLES 
SGDPLPLSDODVEPVLSAPESLQNLFNNRTAYVLADVKDDQLKS 
MWFTPF0AEEIDTDLDLVKVDUELSEKCCSDFDLHS2LERSF1* 
SEPSSPGRTKTTKGFKLGKHKHETF1TSSGKSEY1EPAKRAHVV 
PPPRGRGRGGFG0GXRPHD1FR0RK0NTSRPPSMHVDDFVAAES 
KEWPQD6I PPPKRPXiKVSOKI SSRGGFSGNRGGRGAFHSQNRF 
FTPPASKGNYSRREGTRGSSWSAQNTPRGNYNESRGGQSNFNRG 
PLPPLRPLSSTGYRPSPRDRASRGRGGLGPSWASANSGSGGSRG 
KFVSGGSGRGRHVRSFTR 


6896 


1 


555 


GNIVIOKKKYNKQK1IPLENVTXDSIKDEGDLRNGWLIKTPTXS 
FA VYAATATEKSEWMNH INK CVTDLLS KSGKTPSNEHAAVWVPD 
SEATVCMRCQKAKFTPVNRRHHCRKCGFVVCGPCSEKRFLLPSQ 
S S KPVR I CDFCYDLLS AGDMATCOP ARSDS YSCSLKS PLNDMS D 
DDDDDDSSD 


6897 


3 


920 


GDGbMHEVVNGbMERPDWETAIOKPLCSLPAGSGNAIiAASLNKY 
AG Y EQVTN 1 EDLLTN CTLLLCRRLLS PMNLLS LHTASGLRLFSVL 
SLA WG FI AD VDLES E KYRR LG EMR FTLGTFLRIiAALRT YRGRLA 
VLPVGRVGSKTPASPWVQOGPVDAHliVPLEEpVPSHV?TWPDE 
DF VL VLA bLKSHbG S EMFAAPMGR CAAG VMHLF YVRAG VSRAML 



568 



BNSDOCID: <WO 0153312A1_I_> 



WO 01/53312 



PCTAJS00/342G3 



SEQ 
ID 

NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
locatior. 
corresponding 
to first 
amino acic 
residue ci 
amino acic 
sequence 


Ammo acid segment containing signal peptide 
{A^Alanine, C=Cysteir.e , D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
K=Histidine, I=Isoleucine, K= Lysine, 
L=Leucine, M=Methicnzr.e , N=Asparagine , 
P=Proline, Q=Glutamine, R=Arcinine, 
£=Serine, T=Threonine, v=Valine, 
W=Tryptophan, Y= Tyros me, X=Unknown, *=Stcp 
Codon, /=possible nucleotide deletion, 
\s=possible nucleotide insertion) 








LRLFLAMEKGRHMEYECPYLVVVPVVAFKLEPKDGKGVFAVDGE 
LMVSEAV0G0VHPNYFWMVSGCVEPPPSWKP0QMPPPEEPL 


6896 


919 


34 e 


OKTVTAVAS LLKGRCG I YTEN ERRMG A V 3 K I R FFK I MLVL 1 1 CW 
LSN I INES LLFYLEMQTDI KGGSLKP VR TAAKTTWF I M6I LNPA 
QG FLLS LA FYG WTGCSLG FQ £ F R KE I QWE S LTTS AAEG AH F£ F L 
MPHENPASGKVSQVGGQTSDEALSMLSEGSDASTIE1HTASESC 
NKNEGDFALPTHGDL 


6899 
1 6900 


120 


827 


MKVRKNNDAYLLDKNKINKDCFISCFFKKMLTTLMFSKSGILSL 
LEHGEEYTFSLPC^YARSILTVPWVELGGKVSVNCAKTGYSASI 
TFHTKPFYGGKLHRVTAEVKHN1TNTWCRVQGEWNSVLEFTYS 
NG ET KYVDLTKLA VT K KR VR P LE KQD P FE S RR L W KNVTDS LRES 
EIDKATEHKHTLSERQRTEERKRTETGTPKKTKYFIKEGDGWVY 
HKPLWKI I PTTQPAE 


I 


3 


453 


TEVLGSKG1 KELRSSTSALHHALEESASLLTMFWRAAJjPSTHI P 
VLPGKVGESTERELLELRTKVSO0EQLL0STTEHLKNANQQKES 
yiEOFIVSOLTRTHDVLKKARTIvLEVRKLLHOSEAPSLSPTHHHP 
LADLVGDSWPALRFOEK 


6901 


1 


20: 


DDNMVQR LE TD FKMTLQQQS TL EQ WAA W LDNVMMQAL KP YEGR P 
SFPKAARQFLLKWSFYRYHLGFE 


| 6902 


2 


267 


G AP P PP PS Q P PRQP PQAAP S £ H PH S DLT FN F S S ALEGQAGAQG A 
SDMPEPSLDLLPELTNPDELLSYLDPPDLPSNSNDDLLSLFENN 


G903 




14 9 


RIN0VYR03PTGIHILVIDQMVQNFQDESCFLFSTVKAESSDGI 
HULK 


6 9 04 


464 


2092 


MEASLPVSLSCVLACGDVEGKFDILFNRV0A1QKKSGNFDLLLC 
VGNFFGST0DAEWEEYKTG1KKAPIQTYVLGANNQETVKYFQDA 
DGCELAENITYLGRKGIFTGSSGLQIVYLSGTESLNEPVPGYSF 
S PKDVS SLRMMLCTTSQFKGVD I LLTSPWPKCVGNFGNSSGEVD 
TKKCGSALVSSLATGLKPRYHFAALEKTYYERLPYRNHIILQEM 
AQKATRFI ALANVGNPEKKK Y LYAFS I V PMKLMDAAELVKQPPD 
VTENP YRKSGQEAS IGKQ1 LAP VEESACQFFFDLNEKQGRKRSS 
TGRDSKSS PHPKOPRKPPOPPG PCWFCLAS PE VEKHLWN J GTH 
CYLALAKGGLSDDKVLILP 2 GH YQSWELSAEWEEVEKYKATL 
RR FFKSRG K WC VVF ERN YKSK H LQLQV I PVP1SCSTTDDIKDAF " 
I TOAQEQQI ELLEI PEHSD1 KG I AQPGAAYFYVELDTGEKLFHR 
I K KNFPLQ FGRE VLAS EAI Lirv ? DKS D WR Q CQ I S KED E ETLAR R 
FRKDFEPYDFTLDD 


6905 


1 


226 


VSKTGEAET1 TSHYLFALGV Y KTLYLFNW I WRYHFEGFFDLI AI 
VAGLVQTVLYCDFF YLYI TKVL KG KKL S LPA 


6906 


3 


611 


S YDDHNGH I DFI TAASNLRAKMYS I EPADRFKTKR I AGKI I PAI 
ATTTATVS G L VALEM I KVTGG V P FEAYKNWFLNLA1 P I WFTET 
TEVRKTKIRNGISFTIWDRWTVHGKEDFTLLDFINAVKEKYGIE 
PTMW0GVKMLYVPVT4PGHAKKbKLTMKKLVKPTTEKKYVDLTV 
S FAPDIDGDEDLPGPPVRYY F£ HDTD 


6907 


2 


2228 


LRGVPVWAAGAFRFSSGEESTSHLIMSRRSORLTRYSQGDDDGS 
S S SGGSS VAGSQS TLFKDS PLR TLKRKS S NMKRLS PA PQLGPS S 
DAHTSYYS E SLVKES WFPPRS S LEELHG DANWGEDLR VRRRRGT 
GG S E S S RAS GLVGRKATEDF LG S S SG Y SS E DDYVGYS DVDQOS S 
SSRLRSAVS RAGSLLWMVATS FGRLFRLLYW WAGTTWYRLTTAA 
SLLDVFVLTRRFSSLKTFLWFLLPLLLLTCLTYGAWYFYPYGLQ 
T FK PALVS W W AA KDS RRADEG W E ARDS S F K FQ AEQRVMSR VHS L 
ERRLEALAAEFSSNW0KEAMRLERLELR0GAPG0GGGGGLSHED 
TLALLEGLVSRREAALKEDFRFETAARIOEELSALRAEHQQDSE 
DLFKKIVRASOESEAR I QQLKS E WQSMTOES FQESSVKELRRLE 
D0LAGLQ0ELAALALKQS S VAEE VGLLPQ0 I QAVRDDVESQFPA 
W1SQFLARGGGGRVGLLQREEK0AQLRELESKILTHVAEM0GKS 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, OCysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
K=Histicine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P= Proline, Q=Glut amine, R^Arginine, 
S=Serine, T*Threonine, V=Valine, 
W=Tryptcphan, Y=?yrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possib!e nucleotide insertion) 








AREAAASLHLTLQKEGVIGVTEEOVHHIVKQALQRYSEDRIGLA 
DYALESGGASVlSTRCSETYETKTALLSLFGlPLWyKSOSPRVI 
LOPDVHPGNCWAFQGPOGFAWRLSARIRPTAVTLEHVPKALSP 
KSTI SSAPKDFAI FGFDEDLQOEGTLLGKFTYDQDGEFIOTFHF 
OAPTMATYCWELRILTNVJGHPEYTCIYRFRVHGEFAH 


1 6908 


3 


780 


QV PS AAW LKAV cglg s r lg lg s r lg logcfgaar lly f r fq s rg 

P0GVEDGDRPQPSSKTPRIPK1YTKTGDKGFSSTFTGERRPKDD 
QVKEAVGTTDELSSAIGFALELVTEXGHTTAEELQK1QCTLQDV 

gsalatpcssareahlkyttfkagpileleqwidkytsolpplt 
afi lpsggk i s s alhfcravcrraerrwplvqmgetdanvakf 

LNRLSDYLFTLARYAAMKEGNQEKIYKKNDPSAESEGL 


6909 


3 


4 OS 


gkliavgti^ygqrssapeqellvqdatpvsnsllpekafsdif 

SPYLRGT3KMMQAVRQAFQDQDDRRTWDGRPLTMAATFDDCLYA 
LC\^VDTIKRSSQTGEWONIAIMTEEPELSPAYLISEAMRRSRMS 
LYC 


i 6910 


1 


1068 


LVPVWIUSYYYGKLVIAPLNIVLYNIFTPHGPDLYGTEPWYFY 

LAPMYIWF2 I FFIOPHKEERFLFPVYPLICIiCGAVALSALQHSF 
LYFOKCYTIFVFORYRT.FHYTVTSNWLALGTVFLFGI.T.CF^R^VA 
LFRGYHGFLDLYPEFYRIATDPT1HTVPEGRPVNVCVGKEWYRF 
PSSFLLPEKW0LOFIPSEFRGQLFKPFAEGPLATRIVPTDMNDQ 
NLEEPSRY3 D1SKCHYLVDLDTMRETPREPKYSSNKEEVJISLAY 
RPFLDASRESKLLRAFYVPFLSDOYTVYVNYTILKPRKAKQIRK 
KSGG 


6911 


1184 


96C 


GEDAEEMETGNVANLISIFGSSFSGLLRKSPGGGREEEEGEESG 
PEAAEPGC3 CCDKPVLRDMNPWSTAIVAF 


6912 


1 


644 


AKKPVETHSFOMLFTILSTGSALKAQSYEDAYRCI KSS 1 TjLGSI 

sggtdi 1 scfmghn fslp vy kg ei oarnlgmaveawn eegkavw 
gesgelvctkpipcqpthfwndengnkyrkayfskfpgivjahgd 
ycrinpktggivmlgrsdgtlnpngvrfgsseiynives feeve 

DSLCVPQYNKYREERVILFLKMASGHAFQPDLVKRIRDAIRMGL 
S7\RHVPSL3 LETKGIPYTLNGKKVEVAVKQI 2AGKAVEQGGAFS 
NFETLDLYRD1PEL0GF 


| 6913 


1643 


. 1558 


KKSHEESHKEELSYGAOASLPLPCSDFR 


1 6914 

i 


1251 


615 


ELAAECKSAGYPGTLIPYRCDLSNEEDILSMFSAIRSOHSGVD1 
CINNAGLAK FDTLLSGSTSGWKDMFNVNVLALS I CTREAYQSMK 
ERNVDDGK 3 INI NSMSGHRVLPLSVTHFYSATKYAVTALTEGLR 
QELREAQTK1 RATC1 SPGWETOFAFKLHDKDPEKAAATYEOMK 
CLKPEDVAEAVIYVLSTPAHIQIGDIQMRPTEQVT 


1 6915 


254 


652 


GRSLS FKTF LI WVLI S I YQGGILMYGALVLFESEFVH WAI S?T 
AlilLTELLMVAiTVRTVmWLrWVAEFLSIjGCYVSSLAFLNEYFD 
VAFITTVTFLWKVSAITWSCLPLYVLKYLRRKLSPPSYCKLAS 


i 6916 


254 


652 


GRSLSFKTFLIWVLISIYQGGILWYGALVLFESEFVHWAISFT 
AilLTELL^ALTVRTWTWLMWAJEFLSLGCYVSSLAFLNEYFD 
VAFITTVTFLWKVSAITWSCLPLYVLKYLRRKLSPPSYCKIiAS 


! 6917 


254 


652 


GRSLSFKTFLIWVTjISIYOGGILMYGALVLFESEFVHWAISFT 
All I LTELLMVALT VRTW H WLMVVAE FLSLGCYVS S LA FLNE Y FD 
VAFI TTVTFLWKVSAI T WS CLPLYVLKYLRPvKLS PPS YCKLAS 


i 6918 

- 


28 


921 


PEAGTRSWREPDPEDLRRFLLSAACRSFPQWLPGGGGGQVSSCS 
DTDVPYLLLAVKSEPGRFAERQAVRETWGSPAPGIRLLFLLGSP 
VGEAGPDLDSLVAWESRRYSDLLLWDFLDVPFNQTLKDLLLLAW 
LGFJ4CPTVSFVLRA0DDAFVHTPALLAHLRALPPASARSLYLGE 
VFTQAMPLRKPGGPFYVPESFFEGGYPAYASGGGYVIAGRLAPW 
LLRAAARVAPFPFEDVYTGLCIRALGLVPOAHPGFLTAWPADRT 
ADHCAFRNLLLVRPLGPOASIRLWKQLQDPRLQC 
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WO 01/53312 
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SEQ 
ID 
NO : 


Preci cted 
beginning 

I i C^. J. CUCIOC 

location 
cor re spondi nc 
to first 
amino acid 
residue of 
amino acid 
secuence 


Predicted end 
nucleotide 

1 <"» r* a t~ l or 

corresponding 
to first 
amino acic 
residue of 
srr.ino acid 
sequence 


Amino acic segment containing signal peptide 
(A=Alaninc , C=Cysteine, D=Aspartic Acid, t- 

Rl nhami r *'-ir P— PViPn-vrl aT an i R-fil vri np 
vjiuLfliiuv /-*. j i^t , r -riiciiy laXdllJLiic « o— j uxiic , 

H~Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, 0=Glutamine, R=Arginine, 
S=Serine, T -Threonine , v= Valine, 
W=Tryptoph&n, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=pcssible nucleotide deletion, 
\=possibl( nucleotide insertion) 


6919 


850 


41 


OGR^ELSGS V } CPF I OOEPKEMLTLSEYHERVRSCGOOLOOLOA 
ELDKLHKEV^VVRAANSERVAKLVFQRLNEDFVRKPDYALSSVG 
AS I DLQXTSK 7 V ADRNTA Y F WNR FS FWNYAR PPTVI LE PHVFPG 
NCWAFEGDOGCWIOLPGRVOLSDITLQHPPPSVEHTGGANSAP 
RDFAVFFLLS 7 FTHQGLQVYDETEVSLGKFTFDVEKSE I QTFKL 
ONDPPAAFPrvKlQILSNWGHPRFTCLYRVRAHGVRTSEGAEGS 
AOGPH 


6920 


1418 


591 


eaogpskvhl: ;_kkkk 


6921 


2 


1712 


MNATRSEEQF}-: VI NHAEQTLRKMENYLKEKQLCDVLLI AGHLR I 
PAHRLVLSAVi DY FAAMFTNDVLEAKQEEVRMEGVDPNALNS LV 
0YAYTGVLQLKEDT3ESL1AAACLLQLTQVIDVCSNFLIKQLHP 
SNCLGIRS FGDAOGCTELLNVAHK YTMEHFI EVI KNQE FLLL PA 
NE I SKLLCSBr 3 NVPDEETI FHALMQWVGHDVQNROGELGMLLS 
Y 1 RLPLLP PQ>- -ADIjETS SMFTGDL^CQlUjIJiEAMKYHLLPERK 

smmos prtkpr kst vgaly avggmdamkgttt i ekydlrtnswl 
h 1 gtmngrrlc fgvavi dnklywggrdglktlntvecfnpvgk 
iwtvmppmsth:-.hglgvatlegpmyavggkdgwsylntverwdp 
egrownyvasmstprstvgwalnnklyaiggrdgssclksmey 
fdphtnkwslc/^mskrrggvgvatyngflywgghdapasnhc 

SRLSDCVERY r PKGDSWSTVAPI>SVPRDAVAVCPLGDKLYVVGG 
YDGHTYLNTVr 5 YDAQRNEWKEEVPVNIGRAGACWWKL? 


6922 


1075 


365 


LTPPAGIRHEVK DREREREREREREKFPLDSTGSELKQNIHSIT 
GLP PAMQKW Y KGLAPEDKTLREI KVTSGAKI MGGGSTINDVLA 
VNTPKDAAQO^-KAEENKKEFLCROKQHRKVLDKGKPEDVl^PSV 
KGAQERLPTVl 1 SGMYWKSGGKVRLTFKLEQDQLWIGTKERTEK 
LPMGSIKNWKLPIEGHEDYHMMAFQLGF/TEASYYWViTfVPTOY 
VDA J. KDTVLjG K w 0 Y F 


6923 


2469 


1660 


LGLFCILPID':iCAVL.ERDTLSIRESRLFGAWRWAEAECQRQQ 
LPVTFGNKQK V1G KALS LI RF PLMT I BEFAAGPAQSG3 LSDRE V 
VNLFLHFTVN KK PR VEY I DR PRCCLRGKECCINRFQQVESRWGY 
SGTSDRIRFTV?^kRISIVGFGLYGSIHGPTDYQVNIQIIEYEKK 
OTLGQNDTGFf CDGTANTFRVMFKEPIEI LPNVCYTACATLKGP 
DSHYGTKGLKJ'V VKETPAASKTVFFFFSSPGNNNGTSIEDGQIP 
EI3FYT 


6924 


2210 


1235 


PEERVICFVJti . LTAFxiEGRKGAIiAlv.KrxNPliV5t»l rHLoWtvf 
\ KJDRVKPKRTAS X S PASCHEHPMADDPSKSYKLRFVAEQVSHKPP 
I SCFYCECEEK?- LCVNTHVWTKSKFMGMSVGVSMIGEGVLRLLE 
HGEE YVFTLP £ } ■. Y ARS I LTI PWVELGGKVSINCAKTGYSATVI F 
HTKPFYGGKVK.^VTAEVKHNPTOTIVCKAHGEWNGTLEFTYNNG 
ETKVI DTTTLP V Y p KKI RPLEKQGPMESRNLWRE VTRYLRLGD I 
DAATEQKRHLI E KQRVEERKRENLRTPWKPKYFIQEGDGSGI LQ 
SPLESTLMGLFA'G-SFPV 


6925 


2 


1653 


RGGAAGAAME F " S VIEDKTI ELM CSV PRSLWLGCANLVESMCAL 
SCLQSMPSVRCLOISNGTSSVIVSRKRPSEGNYQKEKDLCIKYF 
DQWSESDQVEF VEHLI SRMCH YQHGH INSYLKPMLQRDF I TALP 
EQGLDHIAEN3 1 S YLDARSLCAAELVCKEWQRV1 S EGMLWKKLI 
ERMVRTDPLWKGLSERRGWDQYLFKNRPTDGPPNSFYRSLYPKI 
TDnTFTTFCTNW^ rnRHVT.OR TDfR^FN^KGVYCLOYDnFKlTSG 

LRDKS i kiwdk: S LECLKVLTGHTGSVLCLQYDERVIVTGSSDS 
TVRVWDVNTGE YLNTLI HHNEAVI.HLRFSNGLMVTCSKDRS I AV 
VJDI^SATDITLknVLVGKRAAVNVVDFDDKYIVSASGDRTIKVW 
STSTCEFVRTLI^GHKRGIACLQYRDRLWSGSSDNTIRLWDIEC 
GACLRVLEGHE E LVRCI RFDNKRI VSG AYDGKI KVWDLQAALDP 
RAPASTLCLR71 7EHSGRVFRLQFDEFQIISSSHDDTILIWDFL 
NVPPSAQNETRS PSRTYTYI SR 


6926 


1 


733 


SGRVAMDGLGLC r P £0G FPAGP PLLP PHMGGH YR DCQS LGAP P L 
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BNSDOCID: <WO 01S3312A'.. l_> 
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SEO 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
arrino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino ac:d segment containing sianaji peptide 
(A^Alanine, OCysteine, D=Aspartic Acid, E= 
Glutamic Acid, F» Phenylalanine, G=G^ycine, 
H=Histidine, 3 =Isoleucine, K=Lysine, 
L=beucine, K=Methionine, N=Aspsragine, 
P= Proline, 0=Glutamine, RsArginine, 
S=Serine, T=Threonine, V=Valine, 
W= Tryptophan, Y-Tyrosine, X*=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possibie nucleotide insertion) 








DGYPLPTPDTSPLDGVDPDPAFFAAPMPGDCPAAGTySYAOVSD " 
YAG P PEP P AG PMK P RLG PEPAG PS I PGLLAPPSALH VY YGAMGS 
PGAGGGRGFQMQPOHQHQHQHQHHPPGPGCPTPPPEALPCRDGT 
DPSQPAELLGEVDRTEFEQYLHFVCKPEMGLPYOGKDSGVNLPD 
SHGAISSWSDASSAVYYCNYPDV 


6927 


2 


1484 


L7LCGDI01MLAONAKNRAAHLEEFHYOTKED02ILHSLHRESS 
CQG FA WATDLSTDlrE SQhSVS CXCYEAANE I LQFRDLK SQN PEH 
YVOVLKRJ^GN : RA-E1 GVFYMN0AA7U,QSEKLVS KS VSAAEOQLW 
KKSFSCFE KG 7 HNFES I EDATNAALLLCNTGRLMR3 CAQAHCGA 
GDEl.KREFSPEEGLYYNKAIDYYLKALRSLGTRDIHFAVWDSVN 
WELSTTYFTMATI.QCDYAPLSRKAOEOIEKEVSEAMMKSLKYCD 
VDSVSARCPLCQYRAATIHHRLASMYHSCLRNOVGDEHLRKOHR 
VLADLHYSKAAKLFOLLKDAPCELLRVQLERVAFAEFQMTSONS 
NVGKLKTLSGALDIMVRTEHAFQLIQKEL3EEFGQPKSGDAAAA 
ADASPSLNKEEVMKLLSIFSSRLSFLLLOSIKLLSSTKKKTSNN 
IEDDTILKTNKHIYSOIiLRATANKTATLLERINVIVKLLGOLAA 
GSAASSNAVQ 


6928 


1086 


777 


EA3 DIiINNLLOVKMRKRYSVDKTLSHPWLODYOTWLDLREIjECK 
IGERY1THESDDLRWEKYAGEQGLQYPTMLINPSASHSDTPETE 
ETEMKALGERVSIL 


6929 


1749 


607 


RDORGYRDDRS PAREPGDVSARTRSGGGGGRSATTANP P PV PNG 
NLH0HDP0DLRHNGNWVAGRPSCSRGPRRA10KPQPAGGRRSG 
RGPAAGGLCLQP FDGGTC V PEEP PVP PMDWE ALE KHLAGLQFRE 
OEVRNQGOAK'rNSTSAQKKERESlRQKLALGSFFDDGPGIYTSC 
SKSGKPSLSSRLOSGMNLQICFVNDSGSDKDSDADDSKTETSLD 
TPLS PMS KO£ S S Y SDR DTTEEESESLDDMDFLTRQXKI.QAEAKM 
ALAM7iKPMAKM0VEVEKQNRKKSPVADLLPKMPH I SECLMKRSL 
KPTDLRDMT3 GQLQV1 VNDLHSQIESLNEELVCLliLl RDELHTE 
QDAMLVDIEDL»TRHAESCX?KHMAEKMPAK 


6930 


131 


545 


FKD7ANVFVSLFQMRNNFRHYFI EPSQLKLFYDVITWI VTQVAI 
SrTWPFVLLSIKPSLTFYSSWyyCLHILGILVLLLLPVKKrOR 
RKNTHENICLSOSKKFDEGENSLGQNSFSTTNNVCNPNOEIASR 
HSSLKQ 


6931 


2 


6S9 


F^ERLPNRFACLLVASGAAEGVSAQSFIjHCFTMASTAFNLQVAT 
PGGKAMEFVDVTESNAR WVQDFRLKAYAS PAKLES I PGARYKAb 
LIPSCPGALTELASSGSLARILQHFHSESKPICAVGHGVAALCC 
ATNE DR S WV FDS YS LTGPS VCELVRAPG FARLFL WEDFVKDSG 
ACFSASEPCAVHWLDRHLVTGQNASSTVPAVQNIiLFLCGSRK 


6932 


2 


1131 


FVDS PGQGEOAEEE EGG IQMNS RMRAHS PAEGAS VES £ S PGPKK 
SDMCEGCRSLAAGKPGY I SHDKSTS I KWSHQHPSH POUFS 1 VR 
QACVRSLS CE VCPGREGPI F FGDEQHGFVFSHT FF 1 KJDSLARGF 
ORWYSIITIKiMDRlYLlNSWPFLljGKVRGIIDEWGKALKVFEA 
EQFGCPQRAORKNTAFTPFLHORNGNAARSLTSLTSDDNLWACL 
HTSFAWLLXACGSRLTEXLbEGAPTEDTLVQMEKLADLEEESES 
WDNSEAEEEEKAPVLPESTEGRELTQGPAESSSLSGCGSWQPRK 
LPVFKSLRRKRQVGGRGTAHHELRRRANHGLCLPTRLASGPSTL 
KTLOEVTDSLLGGWjuMAQGVGGI I 


6933 


1431 


890 


SLNLHCTLPPPPHOYPAGYPSDKEGKKPKGOSKKQPSGTTKRPI 
S DDD CP SAS KVYKA S DS AEAI E AFQLTPQQQHLI REDCQNQKLW 
DE VLSHLVEG PNFLKKIiEQS FMCVCCQELVYQP VTTECFHNVCK 
DCLORSFKAOVFSCPACRHDLGQNYIMIPKEILOTLLDLFFPGY 
SKGR 


6934 


3030 


2588 


DRDHSQCGG J RRVA1ARVSSVKLJSKAKIRTVKMTF1 1 VliAFIV 
CWTPFFFVOMWSVWDANAPKEASAFIIVMLIiASIjNSCCI^PWIYM 
LFTGK LFHELVQR F LCCS AS Y1>KGRRIX3ET S AS KKS NS SS FVLS 
HRSSSORSCSOPSTA 
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SEO 
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KC: 


Predicted 
beginning 
nucleotide 
lccation 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
3 ocaticn 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid secment containing signal peptide 
{A-Aianine, C=Cysteine, D-Aspartic Acid, £= 
Glutamic Acid, F=PhenylaIanine, G=Glycine, 
H=Histidine, I^Isoleucine, lULysine, 
L=Leucine # M=Methionine , N=Asparagine . 
P=Proline, Q=Giut amine, R=Arginine, 
S=Serine, T~Threonine , V=Valine, 
We Tryptophan, Y= Tyrosine, X=Unknown, *=Stcp 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 




686 


54 3 


•JSALYVAGGNDGTSCLNSVERySPKAGAWESVAPMNlRRSTKDL 
VAMDGW LYAVGGNDGS SS LNS I EKYNPRTNKWVAAS CM F7RRSS 
VGVAVLELLNFPPPSSPTLSVSSTSL 


6536 


1347 


567 


K SHRROFLSRALLEFFGKSHPPPHRLFRKSLNVGLHYSHI PFLT 
TCLKFLRKRLQKGEVGLSVETSKPQVPVGGLSRKKVPOEPKATV 
KEKRLOEAOLyKEEGNOPYREGKYRDAVSRYHRALLOLRGLDPS 
LFSPLPNLGPQGPALTPEQENILHTTQTDCYNNLiAACLLOMEPV 
KYERVREYSQKVLEROPDNAKALYRAGVAFFHLCDYDOARHYLL 
7J\VNR0PKDANVRRYL0LT0SELSSYERKEKQLYLGMFt 


6537 


1 


727 


AVEFRCCPGRDPACFARGWRLDRVYGTCKCDQACRFTGDCCFDY 
DRACPARPCFVGEWSPWSGCADQCKPTTRVRRRSVQQEPQNGGA 
PCPPLEERAGCXEYSTPQGODCGHTYVPAFITTSAraKERTROA 
TSPHWSTHT3DAGYCMEFKTESLTPHCALENRPLTRWM0YX-REG 
YTVCVDCOPPAMNSVSLRCSGDGLDSDGNQTIiHWOAIGNPRCOG 
TWKKVRRVDOCSCPAVHSFIFI 


6538 


3 


719 


KSRKLEIiAERVDTDFMQLKKRRQSSEKENDSGTLDTVGAVWDH 
EGNVAAAVSSGGLALKHPGRVGOAALYGCGCWAENTGAHiv'PYST 
AVSTSGCGEHLVRT1 1»AR E CS HAliOAEDAHQAbLETM QN KF 3 SS 
PFLASEDGVLGGV3VLRSCRCSAEPDSS0NK0TLLVEFLWSHTT 
ESMCVGYMSAQDGKAKTH1SRLPPGAVAGQSVA3EGGVCRLGEP 
S ELTLOAECEASQRHFRT 


6539 


3 


810 


KVTAPRRPQRYSSGHGSDNSSVLSGELPPAMGRTALFKHSGGSS 
G Y E S IjRR DS E ATG S AS S APDS MS E SGAAS PG ARTRS LK S P K KRA 
TGbORRRIil PAPLFDTTALGRKPS LPGOW\T)LPPPLAGS LKEPF 
EI KVYEI DDVERLORPRPTPREAPTCXSIiACVSTRLRLAERRQOR 
LREVOAKHKHLCEELAETOGRLMLEPGRWLEQFEVDPELEPESA 
E Y LAALE RATAALEQ CVNLCKAHVKM VT C FD I S VAAS AA3 PGPQ 
EVDV 


6940 


1188 


496 


G>CHAA0PLRKRSRCATPPRGDFCGGTERA1D0ASFTTSMEWDTQ 
W KG S S P LG PAGLG AEE PAAG PQL P S WLQPERCAVFQ CAC CHA.V 
IUADSVKLAWDLSRSLGAVVFSRVTNNVVLEAPFI..VG3 EGS LKGS 
TYTCLLFCGSCGIPVGFHLYSTHAALAALRGHFCLSSDKIWCYLL 
KTKAIVNASEMDIQNVPLSEKIAELXEKIVLTHNPXKSLMKILS 
EVTPDQSKPEH 


6541 


1 


713 • 


SLS RADSDPHGPHTCGHVLNVI 2 GSNVLALAEAQROAEAl-G YQA 
VYL S AAMQGD VKSMAQ FYGLIiAHVARTR LT P SMAGAS VEEDACL 
KEliAAElOIPDLOLEEAIiETMAWGRGPVCLIAGGEPTVOLOGSG 
KGGRNOELALRVGAELRRW PLG P 1 DVbFLSGGTDGODG PTEAAG 
AWVTPELASQARAEGLDIATFIiAHNDSHTPFCCLQGGAHLLKTG 
KTGTNVMDTHLLFLR PP: 


6542 


1 


24 6 


GDYVERYDPKTDTWTMGAPLSMPTNAVGGCLLGDRLYADGGYDG 
CTYLNTMESYDPQTNEWTQMASLN3GRAGACVWIKOP 


6543 


1 


739 


PMATGDGAKTLAI HVKALTADS 3 R 3 TWKATbPASS FRLSKLRLG 
K S PAGGS 1 TE7LVQGDKTE YLLTALEPKPTY 3 1 CMVTM ETTNA Y 

vadetpvcakaetadsygptttlnoeqnagpmaslpiag: I GGA 

VALV FLFLVLGAI CW YVHQAGELLTRERAYNRGSRKKDDYMESG 
TKKDNS 3LE 3 RGPGLQMLP INPYRAKEEYWHTI F PSKGS SLCK 
ATHT 3 GYGTTRGYRDGG 3 PDIDYSYT 


6944 


960 ' 


156 


VAN 3 LLNGVKYESELTGSSERAEQPLSVGRLCSTICNMPKALRT 
LCVNK FLGWLS FEGMLLF YTDFMGE WFGGDFKAPHTSEA YQKY 
NSGVTMGCWGMC3 YAFSAAFYSAILEKIjEEFLSVRTLYFIAYLA 
FGLGTGLATLSRNLYWLSLC1TYGILFSTLCTLPYSLLCDYYQ 
S K KFAGSSADGTRRGMGVDI SLLS CQYFLAQ 3LVSLVLG PLTSA 
VGSANGVMYFSSLVSFLGCLYSSLFVIYE3PPSDAADEEHRPLL 
LNV 



573 



BNSDOOD: <WO_0153312A1J_s 



WO 01/53312 



PC17US00/34263 
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ID 

NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
ccrresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amine acid segment containing signal peptide 
(A=AIanine, C=Cysteine, D=Aspartic Acid, E- 
Glutamic Acid» F= Phenylalanine , G=Glycine, 
H=Histidine, I^Isoleucine, K=Lysi.ne, 
L=Leucine, M^Methionine, N^Asperagine, 
P=Proline, Q=Glutamine, RsArganine, 
S=Serine, T= Threonine , V& Valine, 
W=Tryptophan, Y^Tyrosine, X^Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 


6945 


2067 


179 


EGEDRGLPRTMGAALGTGTRLAPWPGRACGALPRWTPTAPAQGC 
HSKPC-PARPVPLKKRGYDVTRNPHLNKGMAFTLEERLQliGIHGb 
I-PPCFLSQDVQLLRIMRYY ERCQSDLDKY 3 1 LMTLQDRNEKLFY 
RVLTSDVEKFMPIVYTPTVGLACQHYGLTFRRPRGLFIT1HDKG 
HLATMU3 S WPEDN I KAVWTDGER1 LGLGDLGCYGMGI PVGKLA 
LYTACGGVNPQQCLPVLLDVGTNNEELLRDPLYIGLKHQRVHGK 
A YDDL LDEFMOA VTDK FG I NCLJQFEDFANANAFRLLNKYRNKY 

LPIi IML/UJ-VVj I f\J v t\V f\vz J. ±jf\t\±jt\A. X rVWrvUoI\n V r or \JO>*vsE«/»i-u*J 

G\ IAHLLVMALE\KEGVPKA\ EATRKI W\MVDF\KGLI VQGRDH 
LNHEKEMFA0D\HPEVNSLEEWRLVKPTA11GVAAIAEA\FTE 
02LRDMASFH£RP\ 1 1 FALSNPTS KAECTA\EKCYRVTEGPRGF 
FAS\GSPF*GVLIWEMGXTFIPGGRGNNA*RVPRGWOLGVHSPG 
GDPGHIP\DEIFLPDSRAKLPQEVSEQHLSOGRLYP\PLST\lR 
NVFLRIAIKVFD*GYKHNLV\SYYPEPKD\KEAFCKIPGSYTPD 
Y DS FYT / VDS Y I WAQGKAMNVQTV 


6946 


133 


2551 


S CEYSG2 TVAPGDPCPGVAHLbAPSMASDTFESLMALCTDFCLR 
NLDGTLG YLLDKETLRLH P DI FLPSE I \ CDRLVNEY V EliVNAAC 
NF\EPHE\SFFNPLFRDPRKQPASRRIHL\RED\LVQD\QD\LE 
AIRKODL\VEL\YLTN\CEKLSAKSLQTLRSFSHTLGVP*AFFG 
C \TN I L.LLR KENPG GL/ CE DEYLFNPTCQVLV KDFT F EGFS R LR 
F \ LKLGRM1D WVPVES\LLRPLN£LAALDLSG I QTSDAA\ FLTQ 
W KDS L\ VS bVL\ YN MDLSD DJ1 1 R \ V I VOLHKLRHLD 2 S RDRLSS 
YYKFKLTREVLSLFVQKLGNLMS LD I SG\HM I LENCS ISKIGKR 
EAGQTSI\EPSK\SSIIPFRGFEGGPLQF\LGVF»GIFCGRI*TH 
I PAYKVSGDKNEEOVLNAI EAYTEHRPEITSRAINLLFDIAR1 E 
RCNOLbRAbKLVITALKCHKYDRNIQVTGSAALFYLTNSEYRSE 

OYRRVNELLLS ILNPTKODES I QRI AVHLCNALVCQVDNDHK3A 
VG KMG F WTMLKLTQKKLLD KTCDQ VMEFS W\ SAL WN I TDETPD 
NCEMFLNFNGMKLFLDCLNEFPEKQELHRNMLGIiLGNVAEVKEL 
R PQLMTSQFI S VFSNLLES KADGIEVS YKACG VXSH I MFDG PEA 
WGVCEPOREEVEERMWAA20SWDINSRRNIKYRSFEPI1.RLLPQ 
G2 SPVSCHWATWALYNLVS VY PDKYCPbLl KEGGMPLLRDI 1KM 
ATAROET KEMARKVI EHCSNFKEENMDTSR 


6947 


2 

1 


1682. 


rSVSTIPRGIASARPQSRSWRCCPVWRRSPGRARGRGLKMLNVP 
SOS F P A? RSQQRVASGGR S KV P LKQGRS LMDW 1 RLTKSG KDVTG 
LKGRLI EVTEEELKKWKKDDCW1 C1RGFVYNVSPYMEYHPGGE 
DELMRAAGSEGTELFDOVHRWVNYESMLKECLVGRMAIKPAVLK 
D YREEEKKVLNGMbPKSQVTDTLAKEGPSYF SYDW FQTDSLVTI 

/FHTY*TFr?VDFI?I.KITJc:* c J<;F* FTjVSRNNY *GLIj2S YTYW/R*A 
MRFRXI FbCGL/CESVGKlElVLiQKKENTSWDFLGHPLKNHNSL 
IPRKDTGLYYRKCQLISKEDVT«DTRLFCLMLPPSTHLQVPIGQ 
HVYLXLP I TGTEI VKP YTP VS GSLLS E FKEPVLPNNKYI YFLI K 
IYPTGLFTPELDRLQIGDFVSVSSPEGNFKISKFQELEDLFLLA 
AGTGFTPMVKILNY ALTDI P S LR KVKLMF FN KT EDD 1 1 WRS QbE 
KLAFKDKRLDVEFVIjSAP 1 SEWNGKQGH1SPALLSEFLKRNLDK 
SKVhVCJ CGPVPFTE0GVRLLHDLNFSKNE1KSFTA 


6948 


104 


58 


PDGAHSFFPDEYFTCSSLCLSCGVGCKKSMNHGKEGVPHEAKSR 
CRYSH0yDNRVYTCKACYERGEEVSWPXTSASTDSPWJ4GLAKY 
AWSGYVI ECPNCG WYRSRQ YWFGNQDPVDTWRTE I VH VWPGT 
DGFI/KDI^^AAQRLLDGMKFMAQSVSELSLGPTKAVTSWliTDOI 
APAYWRPNSQILSCNKCATSFKDNDTKHHCRACGEGFCDSCSSK 
TRPVPERGWGPAPVRVCDKCYEAR/TRPVSCYRGTSGR*RRRRT 
OETVE 


6949 


152 


4656 


GLRLCLSRPLTRPGDDSVGGSAMLASGAGGVGGGGGGKIRTRRCH 
OGPIKPYQG^RQQHQGILSRVTESVKNIVPGWLQRYFNXNEDVC 
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NO: 


Predicted 
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corresponding 
tc first 
amino acid 
residue of 
amino acid 
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, Predicted end 
nucleotide 
location 
corresponding 
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amino acid 
residue of 
amino acid 
sequence 


Amine acid segment containing signal peptide 
(A=Aler.ine, OCysteine , D=Aspartic Acid, E= 
Glutatr.ic Acid, F^Phenyiclanine, G=Glycine, 
H=Histidine, 3 slsoieucine, K=Lysine, 
L= Leucine, M=Methicnine , N=Asnaragine, 
P=Proline, Q=Glutamine, R=Arcinine, 
S=Serine, T=Threonine, VsValine, 
W=Tryptophan, Y=Tyrosine, X=0nknown, *=Stcp 
Codon, /^possible nucleotide deletion, 
\=poseible nucleotide insertion} 








SCSTDTS EVPRWPENKEDHLVYAX'EESSN 3 TDGRITPEPAVSNT 
EE PS TTSTAS T \ YPDVLTR VSLY R S HLNFSMLES PALHCQPSTS 
SAFPIGSSGFSLVKEI KDST5QHDDDNI STTSGFSSRASDKDIT 
VSKNTSLPPLWSPEAERSHSLSOHTATSSKKPAFNLSAFGTLSP 
SLGN SSI LKTSQLGDS PF Y PGKT7 YGGAAAAVRQSKLRNTP YQA 
P\TR R Q*M KA KQLSAQS YGV7SS TAR R I LOS L E KMSSPLADAKR I P 
S1VSSPLNSPLDRSGIDITDFQAKREKVDSQYPPV0RLMTPKPV 
SIATNRSVYFKPSLTPSGEFRKTNORIDKKCSTGYEKNMTPGQN 
REQRES G FS YPNFS LPAANGLSS G VGGGGG KMRRERHAFVAS KP 
LEEEEMEGPVLPKISLFITSSSLPTFNFSSPEITTSSPSPINSS 
OALTNK VCMTS PSS TGS P MFKFSS P I VKSTEANVLPPS S IG FTF 

svpvaktaelsgssstlepiisssv^hvttvnstnckktppedc 
egpfrpae1lkegsvld1lkspgfaspk1lsvaaqptatspwy 
tr pais s fsssgxgfges lkagss wqcdtcllqnkvtdnkci ac 
qaaklsfkdtakqtgietpnxsgkttlsa5gtgfgdkfkpvigt 
wdcd7clvqnkpeaikcvacetpkpgtcvxraltltvvsesaet 
mtas sssctvttgtlgfgdkfkrf3 gswecsvccvsnnaednkc 
vscmsekpgssvptsssstvpvslpsggslglekfkkpegiwdc 
elclv qn kadst kc lace s akpgt ks g fkg fdts £ s s sn saas s 
sfkfgvsssssgpsqtltstgnfkfgdqggfk3gvssdsgyinp 
msegf*fskhivgfkfgvsseskpeevk:<dgkndnfkfglsfgl 
snpvfltpfqfgvsnlgoeekkeellksscagfrfgtgvinstr 
vpant 1 vtsenkss fnlgtz etksvsvaplkcqtseakkeempa 
tkggfsfgnvepaslpsasvfvlgkteeko0epvtstslvfgeg 
kltmke? kc\qp v f s fge forot kden s s k st fs fsmt kp s e ke 

b>c,vrJ\KJ\ i r AFGAQTNTTADGGAAKi- > DJj£> Y LNNbSSbSSTPA TS 
AGGG\IFGSSTSSSNPPVATFVFGC£SNPGSSS\AFGNTAESST 
SOSLLFSODSKLATTSSTGTAVTPFVFGPGASSNNTTTSGFGFG 
ATTTSSSAGSSFVFGTGPSAPSASFAFGANCTPTFGQSQGASQP 
NPPGFGS3SSSTALFPTGS0PAPPTFGTVSSSSQPPVFGQQPSQ 
SAFGSGTTPNSSSAFQFGSSTTNFMFTNNSPSGVFTFGANSSTP 
AASAQPSGSGGPPFNQSPAAFTVGSNGKKVFSSSGTSFSGRKIK 
TAVRRRK 


6950 


2585 


411 


PRPGSRSGLCRRAGERGAVRAGGLSnRTRAE* IMDELHYQDTDS 
DVPEQRDSKCKVKWTHEEDEQLRALVRQFGOODWKFLASHFPNR 
TD0OC0YRWLRVLNPDLVKGPWTKEEDOKVIELVKKYGTKQWTL 
I AKHLKG R LGKQCR ERWHNKLN P E V K KS CWT E EEDR 1 1 CE AHKV 
LGNRWAE I AKMLPGRTDNAVKNHKNSTI KRKVDTGGFLSESKDC 
KPPVYLLLELEDKDGLQSAQPTEGOGSLLTNWPSVPPTIKEEEN 
SEEELAAATTSKEQEPIGTDLDA\rRTPEPLEEFPKREDQEGSPP 
ETSLPYKWVEAANL LI PA VGSS LSEALDLI ESDPDAWCDLSKF 
DLPEEPSAEDSINNSLVOLOASHOOCVLPPRQPSA\LVPSVTEY 
RLDGHTI £ DLSRSSRGELI F 1 SPSTE VGGSG I GTPPSVLKRQRK 
R R VALS P V 7EHSTS l*S FLDS CNS LTF KS TP VKTLPFS PS QFLN F 
VINKQDTLELESPSLTSTPVCSQKW\^TTPLKRDKrPLHQKHAAF 
VTPDOKYSMDWTPHTPTPFKNALEKYGPLKPLPQTPELEEDLXE 
VLRSEAG1 ELI IEDD IRPSKQKRKPGLRRS PI KKVRKSLALDIV 
DEDMKXMKS TLPKS LS LPTTAPSNS S SLTLS G I KEDNS LLNQGF 
LQAK P E KAA VAQKPRS H FTTPAPMS S A WKTVACGGTRDQL FMQE 
KARQLLGRLKPSHTSRTLILS 


6951 


1940 


239 


AGPDDTMKRSLQALYCQLLSFLLI LALTEALAFAIQEPS PRESL 
OVLPSGTPFGTMVTAPHSSTRHTSWMLTPNFDGPPSOAAAPMA 
TPTPRAEGKPPT\TPSPPSLRQ* PPP ILKAP/SSTGFAPAAMAT 
TS S K PEG R PRGQAAPT I LLTKP PG ATS RPTTA P PRTTTRRPPRP 
PGSSRKGAGNSSRPVPPAPGGHSRSKEGQRGRNPSSTPLGQKRP 
LGKI FQiyKGNFTGSVEPEPSTLTPRTPLWGYSSSPQPQTVAAT 
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Predictec 
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location 
corresponding 
to first 
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Predicted end 
nucleotide 
1 oca t ion 
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residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A^Alanine, C=Cysteine, D=Aspartic Acid, h- 

H=Histidine, I-Isoleucine , K=Lysine, 
L^beucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X^Unknown, +=Stop 
Codon, /=possible nucleotide deletion, 
\=posGible nucleotide insertion) 








TVPSNT^WA.PTTTST.rJPAKnKPfP ,R P Zl & OH(^ f; C;T FT crnrzr^T Drift 

TAASGAPVSP/PSCPSAFSAPPPR*PTGWPQP**LLAYCYP\CT 
SRPLSTS SGVFTAATGPTPAAFDTS VSA PSQGJ PQGAS TTPCAP 
^hpsrv^f^tt ^hakeftvjjA p<;p*PTc:cp\rt pnwvpcvDnzxT c; 
S TAWS PPG PGSLGQQGTS PMW PRGTNRSTEP PSA* ARW ISFG* S 
WPSACPSPP\LCPADGVLHEEEEEDRQPGEQ?EAYGNNTKKPGT 
T FQQAC \ RGAAPG E I P VPLKPLRTQLSE P RS PANG D YRDTGM V P 
C 


6952 


658 


304 


PESEGESGEMTDRYTI HSQJjEHLQSKYIGTN ATPTPPSGSG \CE 
PTPRLVLLLHGPLRPSOLLRHCGE*EQSASPLOLDGKDASALWT 
ASRQARGELRbCLTTAVRGTSPSVSPVCOSS 


6953 


1512 


349 


NVJGKTRALASGKHVPFGKQTNPNKS /VHCDS*G**RRETTQDES j 

FS PHFRG KMGG W \ KhE K ELENTEQ P VGGN EG * EHE VTGNLNS D 

PLLELCQCPLCQLDCGSREObIAHVYOHTAAWSAKSYM\CPVC 

GRALSSPGSLGRHLLIHSEDCRSNCAVCGARFTSHATFNSEKLP 

EVLNMESLPTVHNEGPSSAEGKDiAFSPPVYPAGILLVCNKCA7\ 

YRKLLEAQTPSVRKWALRRQNEPLEVRLORLERERTAKKSRRDN 

ETPEEREVRRMRDREAKRWRMOETDEQRARRLQRDREAMRLKR 

AIETPEKRQARLIREREAKRLKRRLEKMDKMLRAQFGQDPSAMA 

ALAAtMN Ft QL>P V SGV h LiD SQLLo KMAF E EQN S S S LjH 


6954 


819 


1 


PPPPFI I PSHPREAGT*AG * KRSGDSECSFPVEQ* A* TRAAAQN 
* PQR* RWTEGNS PQASAVATPGQGAS PAAPR Cf P * PSRRHRRLP 
PGARPPAG* AAPAFTKPWLAG PAS A PQPGAAPLS P PAP PLI RTR 
*CAGAAARGRPRRDRSPRPRTPGGCSWSEPRTPPAVSASAQTP£ 
DAG*AGGR*GQRQRPSTGR*PPGVGGAGRSHRREGTIPGNPHPR 
AS*RAGWQR*PGP/REWGL*EPCGEEMSGPGGPGGAFFNQVGSS 
VN0AMSTG1 : 


6955 


1968 


782 


PPGRRQVRAQVAGAPVGHWGTRAROVKTGGRRRARRTMPFLGOD 
WRSPGWS WI KTEDGWKRCESC50KLERENNHCNI SHS 1 1 LNSED ' 
GEIFNNEEHEYASKKJ?KKDHFRNDTNTQSF | 
ERHGY CTLGEAFN R LDFS S A I QDI RR FNYVVKLLO/L I AKS QLTS 
LSGVAQKNYFNILDKIVQKVLDDHHNPRLIKDLLQDLSSTLCIL 
/N*RSREVCISGKHQYLDLPI RNYSRIiATTATGSSDD* ASE \NG 
LTLSDLPLHMLNN I LYR FSDGWDI I TLGQVTPTLYMLS EDRQLW 
KKLCQYHFAEKQFCRHLILSEKGHIEVIKLMYFALQKHYPAKEOY 
GDTLHFCRHCS ILF WKDSGHPCTAADPDSCFTPVS PQH FI DLFK 
F 


69S6 


8605 


3839 


QTSTS I FASPTSPPVLGESVLQDNSFDLNNGSDAEQEEMETQSS 
DFPPSLTQPAPD0SST10LHPATSPAVSPTTSPAVSLWSPAAS 
PEIS PEVCPAASTWS PAVFS WS PAS SAVLPAVSLEVPLTAS V 
T S PKAS P VTS PAAAFPTAS PAN KD VS S FLETTADVE E I TGEGLT 
ASGSGDVMRRRIATPEEVRLPLQHGWRREVR1KKGSHRWQGETW 
YYGPCGKRMKQFPEVIKYLSRNVVHSVRREHFSFSPRMPVGDFF 
EERDTPEGLQWVQLSASEI PSRI QAITGKRGRPRNTEKARTKE V 
PKVKRGRGRPPKVKITELLNKTDNRPLKKLEAQETLNEEDKAKI 
AKSKKKMRQKVORGECQTTIQGOARNKRKQETKSLKQKEAKKKS 
3CAEKEKGKTKCEKLKEKVKREKXEKVKMKEKEEVTKAKPACKAD 
KTLATQRRLEERQRQOMILEEMKKPTEDMCLTDHQPLPDFSRVP 
GLTLPSGAFSDCLTIVEFLHSFGKVLGFDPAXDVPSLGVLQEGL 
LCQGDS LGE VQDLLVRLLKAALKDPG FPSY COS LKI LG EKVS E I 
PLTRDNVSEILRCFLMAYGVEPALCDRLRTOPFOAOPPOQKAAV 
LAFliVHE LNG STL1INEID KTLES MS S YRKN KW IVEGR LRRL KT 
VLAKRTGRSEVEMEGPEECLGRRRSSRIKEVTSGMEEEEEEESI 
AA VPGRRGRRDG EVDATAS S I P ELERQ I EKL S KRQL FFR KKLLH 
SSQMLRAVSLGQDRYRRRYWVLPYLAGIFVEGTEGNLVPEEVIK 
KETDS LKVAAHAS LN PAL FS MKMELAGSNTTAS S P ARARG R PRK 
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KO: 


Preaictec 
beginning 
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location 
corresponding 
to first 
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Predicted end 
nucleotide 
1 oca t ion 
corresponding 
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Amino acid segment containing signal peptide 
(A«=Alanine, OCysteine, D=Aspartic Acid, 

njj. u \m dill a v. i. U | r — ritCil jr xa -i tz 1 1 X lie $ \j-\JXyv.XllC / 

H=Histidine, I=lsoleucine , K=Lysine, 
L=Leucine, M=Methionine, lv T =Asparagine, 
P=Proline, 0=Glut amine. R=Arginine, 
S=Serine, T= Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X= Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








TKPGSMQPRHLKSPVRGQDSEQPQACLQPEAQLHAPAQPQFQLQ 
LQLQSHKGFL50EGSPLSLGQSQHDLSQSAFLSWLSQT0SHSSL 
LSSSVLTPDSSPGKLDPAPSQPPEEPEPDEAESSPDPQALWFNI 
SAQMPCNAAPTPPPAVSEDQPTPSPQOLASSKPMNRPSAANPCS 
PVQFSSTPLAGLAPKRRAGDPGEMPOSPTGLGOPKRRGRPPSKF 
FKQMEQRYLTQl/TAQ P VP PE MCSGW WW 1 RDPEMLD AMLKALK PR 
G I REKALHKHLN KHRDFLQEVCLR PS ADP 1 FEPROLPAFQEG 1 M 
SWSPKEKTYETDLAVLQWVEELEORVIMSDLQIRGWTCPSPDST 
REDLAYCEHLSDSOEDITWRGRGREGLAPQRKTTNPLDLAVMRL 
AALEQNVE R R Y L R E P LWPTHE WLE KALLS TPNG AP EGTTTE I S 
i fc. 1 i FK J K VW 1 JUfc-KCKSAAy VCJjl_JbGQ.Lifc.KS IAWJLKS VNKV T 
CLVCRKGDNDEFLLLCDGCDRGCHIYCKRPKMEAVPEGDWFCTV 
CIAOQVEGEFTCKPGFPKRGQKRKSGYSLNFSEGDGRRRRVLLR 
GRESPAAGPRYSEEGLSPSKRRRLSMRNHHSDLTFCEIILMEME 
SHDAAW PFliE P VN PRLV SG YRR 1 1 KNPMDFSTMRERLljRGGYTS 
SEE FAADALLVFDNCOT FNEDDS E VG KAGH I MR R F FE \ S R WE E F 
YOGKQGCSVROGRWGVTLWHLPPTKQTKTCHFHLLMLPWVQTQV 
RYNPDF 


6?57 


82 


3514 


KL» 1 V AMPE PT KK E EN E V P AP AP PPEE T'SKEKB AGTT PA KDWTLV 
ETPPGEEOAKQN.ANSOLS ILFIEKPOGGTVKVGEDJ TFI AKVKA 
EDLSEKFTINGSRKWMDliASKAGKMLObKETFERKSRVYTFEMQ 
1 1 KAKDNFAGNYR CEVTYKDKFDS CS FDLEVHESTGTT PNIDIR 
SAFKRSGEGQEDAGELDFSGLLKRREVKC^EEEPOVDVWELLKN 
TKPSEYEK1AFOYESPTCSGMLKRLKRSIREEKKSAAFAKILDP 
VYQVT3KGGRV1?FWELADPKLEVKWNKNGQELRPSTKYIFEDTR 
CQSI1.NIDNC0MTDDSEYYVTAGDEKCSTELLVREPPIMVTKCL 
EDTTDYCGERVE LECEVSEDDAQVKWFKNGEEI I LVOTRYRI RV 
EGKKH ILI I EG ATJ<ADAADYS VMTTGGOSSAKLS VDLKPLKI LT 
PLTDQTVNLGKEI CLKCE I SENI PGKWTKNGLPVOESDRLKWH 
KGR I H KLVIDHALTEDEGDYVFAPDA YNVTLPAKVHVI DPPX 1 I 
LLX3LDADNTVTV I AGN KLRLE I P I SGE P PPKAMWSRG DXAI MEG 
SGRIRTESYPDSSTLVIDIAERDDSGVYHINLKNEAGEAHASIK 
VKWDFPDPPVAPTVTEVGDDWCIMKWEPPAYDGGSPILGYFIE 
RK KKQSSRWMRLNFDLCKETTFEPKKtf I EGVAYEVR 1 FAVNA\ I 
GISKPSMPSRPFVPIAVTSPPTLLTVDSVTDTTVTKRWRPPDHI 
GAAGUDG YVLEY CFEGSTSAXQSDENGEAAYDLPAEDW I VANKD 

Ij ± UJ\ 1 IKT ill KyLtf 1 Ut\l\±]? VK VXSAVJvAAljHoIir'JV.I I vWrliiVAb 

1 1 EPPKIHSPKKLKQTYI RRVGDRV I LVI PFOGKPR PELTWKKD 
GAEIDKNQINIRKSETDTI 1 FIRKAERSHSGKYDLQVKVDKFVE 
TASIDIRIIDRPGPPOIVKIEDVTflGRNVALTWTPPKDDGNAAlT 
GYTIQKADKKSMEWLRVIEHIIEPVFHTELVIGNEYyFRVFSEN 
MCGLSEDATMTKESAVlARDGKIYKNPVYEDFDFSEAPKFTQPli 
VT^RLCHSGYMATLNCSVRGNPKPKITWMKNKVAIVDDPRYRMFS 
NOGVCTLEIRKPSPYDGGTYCCKAVNDLGTVEIECKLEVKVIAQ 


6958 


274 


1663 


PRTSRVKTEGSQGS SAMDFS VKVDI E KEVTCP I CLELLTEPLSL 
DCGHSFCQACITAXI KESVI ISRGESSCPVCQTRFQPGNLRPNR 
HIJuNIlVERVKEVKMSP0EGQKjy)VCEKHGKKMIFCKELX3KVlC 
WVCELSQEHOGKOT FR I NE W JCECQE KLQ VALQRL I KE NQEAEK 
LEDDIRQERTAWKNYIQIERQKILKGFNEMRVILDNEEQRELQK 
LEEGEVNVLDNLAAATDQliVQQRQDAS TLI SDLQRRLRGSSVEM 
LQDVIDVMKRSESV1TLKKPKSVSKKLKSVFRVPDLSGMLQVLKE 
LTDVQYYWVDVMLN PGSATSNVAI SVDQROVKTVRTCTFKNSNP 
CDFSAFGVFGC0YFSSGKYYWEVI)VSGKIAWIIiGVHSX3SSlWK 
RKSSGFAFDPSVNYSKVYSRYRPQYGYVJVIGLQNTCEYNAFEDS 
SSSDPKVLTbFMAV\LPWU5FS 




1 


146$ 


SLVHWEFGRGIEDFPYLFFOLTHCOQRICSVTOAGVOWCDKSS 
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BNSDOCID: <WO_J 
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WO 01/53312 
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SEQ 
ID 
NO: 


Predicted 
beginning 

location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 

corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
{A=Alsnine, C^Cysteine, D=Aspartic Acid, E= 
vsjcwucimic nciu, r-rneuyiaianinc» vr=\;iycxne ( 
H=Histidine, l=Iscleucine, K=Lysinc, 
L=Leucine, I w i=Methionine t N=Asparag; ne, 
P=Proline, Q=G2utamine, R^Arginine ( 
S=Serine, T= Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=UnJcnown, *«Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








LOPQTPGLNQSSHLSLLSSRDYRMbSSFNEWFWODRFWLPPN\'T 
WTELEDRDGRVYPHPODLLAALPLALVLIAMRLAFERFIGLPLS 
RWLGVRDOTRRQVKPNATLEKHFLTEGHRPKEFQLSIjLAAOCGIj 
J LAjKllljRVir KKKRNyUKPyuJ KKF CSASWRrijA' YLSSFVGGLS V 
LYHESWLWAPVMCWDRYPNQLTLSCPAADSEA\£1YWWYLLELG 
FYLSLLIRLPFDVKRKGGGPSSIKPRPHYDPPSTMDFKEQV1H 
HFVAVI hMTFS YSANLLR 1GSLVLLLHJDSSDYXLEACKMVNYMC 
YQQVCDALFLIFSFVFFYTRIiVLFPTQILYTTYYESISNRGPFF 
GYYFFMGLLMLLOLLHVFWSCLILRMIiYSFMKKGQMEKDIRSDV 
EESDS SEEAAAAQEPLQL.KNGTAGGPRPAPTDGPR SRVAGRLTK 
RHTTA? 


6960 


387 


2068 


AKWAREKEMQEF\TRS FF\RGRPDLSTLTHS I VRRRY LAHSGRS 
HLEPEEKQAL KR L VEEE PLKMQ VDEAASR EDKLDL TK KGKR P PT 
PCSDPERKRFRFNSESESGSEASSPDYFGPPAKNCVASRSHTKF 
KEENPRRA\SKAVEESSDEERQRDLPAORGEE££EEEEKGYKGK 
TR KKP WKKQAPG KAS VS RKQAR EE S EES EAEP VOK TAK KVEGN . 
KGTKS LKES EOESE EE I LAQ K K EQR EEE VEE EE KE EDEEKGDWK 
PRTRSNGRRKS AREER SCKQKSQAXR1XGDSDS EE EQKEAASSG 
DDSGRDRE P P VQRKS E DRTQLKGGKRL SGS SED E E DSG KGEPTA 
KGSRKMARLGSTSGEESDLEREVSDSEAGGGPOGERKNRSSKKS 
SR KGUTRSSSSSSDGSPEhKGG KAGSGRRGEDHFAVMRLKRYIR 
ACG AH RN Y K KLLGS CCS HKERLS I LRAELEALG M KGT PS LGKCR 
ALKEQREEAAEVASLDVANI I SGSGRPRRRTAWN PLGEAAPPGE 
LYRRTLDSDEERPRFAPPDWSHMRGIISSDGESN 


6961 


340 


1646 


RFVJSSPTMKPN FSLRLR I FN1JKCMGI PYLSKURADKMRRLGDFL 
KQESFDLALLEEVWSEODFOYLRQKLSPTYPAAHHFRSGIIGSG 
LCVFS KH P I QELTQH1 Y TLNG YP YMI HHGDW FSG KAVGLLVLHL 
SGMVLNAYVTHLI^EYNRQKDIYLAHRVAQAWELAGFIHHTSKK 
ADWLLCGPLNMHPEDLGCCLLKEWTGLHDAYLETRDFKGSEEG 
NTMVP KNC YVS QQELK P F PFG VR X D YVL YKAVS G FY 1 S CKS FET 
TTGFDPKRGTPLSDHEALMAIXFVRHSPPQONPSSTHGPXAERS 
PL/MCVCLKEALDGSLGLGMA\OARWWA\TFA\ S YV1 GLGlAbL 
luALLCVliAAGGGAGEAA 1 LLWTPS VGLVLWAGAFY LFHVQEVNG 
LYRAQAELOHVLGRAREAQDLGPEPQLYALL\LGCOEGDRTKEQ 


6962 


340 


1646 


RP WSS PTMKPNFS LRLR I FNLNCWG I PYLS KHRADRWRRLGDFL 
NQES PDLALLEE VWSEQDFQ YLRQKLS PTYPAAHH PR SGI I GSG 
LCVFS KKPIQELTQHIYTliNGYPYMIHHGDWFSGK^VGLLVLHL 
SGMVLNA YVTHLHAE YK RQKD I YLAH R VAQAHELAQF I HHTS KK 
ADWLLCGDLNMHPEDLGCCLLKEWTGLHDAYLETRDFKGSEEG 
NTMVPKNCYVSQOELKPFPFGVRIDYVLYKAVSGFYISCKSFET 
TTGFDPHRGTPLSDHEALMATLFVRHSPPQQNPSSTHGP\AERS 
PL/MC VCLKEALDG S LGLGMA\QAR W WA\TFA\ £ YVI GLG L\ LL 
bALLCVLAAGGGAGE AA I LLWTPS VGLVLWAGA FY LFHVQE VNG 
DYRA0AEL0HVLGRAREAQDLGPEPQLYALL\LGOQEGDRTKEO 


6963 


374 


2618 


RVTPL I UCLLKKPKTAENQKASEENE I TQPGGSS AX PGLPCLNF 
EAVLSPDPALIHSTHSLTNSHAHTGSSDCDISCKGMTERIHSIN 
LHKFSNSVLETLNEQRNRGHFCDVTVRIHGSMLRAORCVLAAGS 
PFFQDKLLLGYSDIEIPSWSVQSVQKLIDFMYSGVLRVSOSEA 
LQI LTAAS ILQI KTVI DECTRI VSQNVGDVFPG I QDSGQDT PRG 
TPESGTSGOSSDTESGYLQSHPQHSVDRIYSALYACSMQNGSGE 
RSFYSGAWSHHETALGLPRDHHMEDPSWITR1KEKSQ0MERYL 
STTPETTHCRKQPRPVRIQTJbVGNIHIKQEMEDDyDYYGQQRVQ 
ILERNESEECTEDTDQAEGTESEPKGESFDSGVSSSIGTEPDSV 
EQQFGPGAARDSOAEPTQPEQAABAPAEGGPOTI^OIETGASSPE 
RSNEVEMDS TV1 TVSNSSDKSVLQQPS VNTSI GO P !• PSTQLYLR 
QTETLTSNLRMF LTLTS NTQV I G TAGNTYLPALF7TQ PAG SG P K 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleot i de- 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acic segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acic, E- 
Glutamic Acid, F=Phenylalanine/ G=Glycine, 
H=Histidine, I^Iscieucine, K=bysine, 
L=beucine, W=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, TeThreonine, V= Valine, 
W=Tryptophan, Y^Tyrosine, X=>Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








PFLFSLPOFbAGQQTOFVTVSQPGLSTFTAOLFAPQFlASSAGH 
STASGOGEKKPYECTLCNKTFTAKQNYVKHWFVHTGEKPKOCSI 
C WR S F S bKD YL I K \ H MVTHTG VRA YQCS I CNKR FTQKS S LNVHM 
RLHRGEKSYECYICKKKFSHKTLLERHVALHSASNGTPPAGTPP 
GARAGPPGWACTEGTTYVCSVCPAKFDQIEQFNDHKRKHVSDG 


6964 


3 


178 ' 


SGR P FFFFFSNTDVYF I K KVTNRWTAGSS YKMTRMKS I G K I LLL 
01FIG\NCSMFVLVI 


6965 


757 


208 


NVF1 E PR ICG FMKTSAH PGQKHPDFSMGLLFPLLAALE VCSCGS 
SGSLGYMLPCNH\GLLGRNTLVLLGQMRRISPFLCLKDRSDFRF 
PQEKVEVSCLOKA\OAMSFLYDVLQOVFNFSHKALL\CCMEHDL 
PGPTPHFTSSAAGTPGDLLGAGDGRRRSWGQWV1EGSTLALRRY 
FQESISTLE 






loo/ 


ALRSRGTATACRLDNKESESWGALLSGERLDTWICSLLGSLMVG 
LSGV FPLLVI PLEMGTMLRS EAGAWRLKQLLS FA I jGGbLGNVFL 
HLLPEAWAYTCSASPGGEGQSLQQQQQLGLWVIAGILTFLALEK 

GP\t>I KVSGY LNbLANTl DNFTHGLAVAASFLVS KK1GLLTTMA 
ILL1IEI PHEVGDFAI LLRAG FDRWSAAKLQLSTALGGLLG AGFA 
ICTCSPKGVEETAAWVLPFTSGGFLYIAbVNVLPDLLEEEDPW 


6967 




633 


G FbPFKYW 3 LD1>SAS SRMETDCNPMELSSMSGFEEGSELNG FEG 
TDMKDMKIj E AE AWNDV L FA VNNM F VS KS LRCADD VAY I NVETK 
ERNR Y CLELTE AGLKWG Y AFDQVDDHLQTPY H ETVYSLLDTL\ 
S PAYREAFGKR \LLQRLEA1>KRDGQS 


6966 


3 


2265 


RGGGGGRGG PG ARE RE R PG E PERTM E AAAGGRG CFQPHPG LQKT 
LEQFHL5SKSSLGGPAAFSARWAQEAYKKESAKEAGAAAVPAPV 
P AATE P P P VLH LPAI QP P P ? VL PGP FFMPSDR S TERCET VLEGE 
T2 SCFWGGEKRLCLPQ1 L»N S VLRD FS LQQ I NAVCDELH I YCSR 
CTADQLEILKVMGILPFSAPSCGLITKTDAERLCNALLYGGAYP 
PPCKKSLAASLALGLELSERSVRVYHE\CFGKCKGL\LVPELYS 
SPSAAC1QCLD\CRLMYPPHKFWHSHKALENRTCHWGF\DSA\ 
NWRAYILLSODYTGKEEOARLGR\CLDDVKEKFDYGNKYKRRVP 
RVSSEPPAS1RPKTDDTSSQSPAPSEKDKPSSWLRTLAGSSNKS 
LGC VH PRQRLS AFRPWS P AV S AS EKELS PHLPAL I RDS FY S Y KS 

TP FT a V a PNTVZi T . H P D Ann VWQ C P T>C A A A\7Q ¥? AP P PT .ATfTOPT? Jf 

RKLTVDTPGAPETLAFVAAPEEDKPSEAEVEVESREEFTSSLSS 
LSSPS FTSS S SAKDLGS PGARALPSAVPDAAAPADAPSGLEAEb 
EHLROALEGGLDTKEAKEKFLHEWKMRVKQEEKLSAAbOAKRS 
LKQELSFLRVAKKEXUiEATEAKRNLRKEIERXRAENEKKMKEA 
NESRLRLKRELEQARQARVCDKGCEAGRLRAKYSAQIEDLOVKL 
QHAEADREOLRADLLREREAREHLEK\WK\ELQEQLWPRARPE 
AAGS EG V AAELE P 


6969 


1855 


118 


AGTMHGRLKVKTSEEQAEAKRLEREQKLKLYOSATQAVFQKRQA 
GELDES VLELTSQI bGAN P DFATLWNCRREVLOQLETQKSPEEL 
AALVKAEIXSFLESCLRVNPKSYGTWHHRCWLLGRLPEPNWTREL 
ELCARFLEVDERNFHCWDYRRFVATQAAVPPAEELAFTDSLITR 
NFSNYSSWHYRSCLLPOLKPQPDSGPOGRLPEDVLLKELELVON 
AFFTDPjWOSAWFYHRWLLGRADPQDALRCLHVS rdeacltvs F 
SRPLLVGSRME I LLLMVDDS PLI VEWRTPDGRNRPSHVWLCDLP 
AASLN DQLPQK7FRVI WTAGDVQKECVLLKG RQEGWCRDSTTDE 
QI>FRCELSVEKSTVLQSELESCKEIjQELEPENKWCL\LTI I LLM 
RALDPLLYEKETLQYFQTLK\AWDPKRATY\bDDLRSKFLLENS 
VLKMEYAE^VbHbAHKDbTVbCHLEQbbbVTHbDbSHNRbRTb 
PPAbAAbRCbEDPPPRT\VbOASDNAIESLDGVTNbPRbQEbbb 
CNNRbQOPAVLQPbASCPRbVbLNbQGNPLCQAVGIbEObAEbb 
PSVSSVbT 



579 
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SEQ 
ID 
NO: 


Precictec 
beginning 
nucleot i oe 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


yVr.mo acid segment containing signal peptide 
(^Alanine, C=Cysteine, D=Aspartic Acid/ E= 
Glutamic Acid/ F=Phenylalanine, G=Glycine, 
K=Histidine , I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Aspsragine, 
P= Proline, 0=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=?ryptophan, Y=Tyroeine, X«Unknown, *=Stop 
Cocon, /^possible nucleotide deletion, 
^possible nucleotide insertion) 


697C 


3 


152B 


S F' P P LLS S P S A VGEGKVA VAAPC PG R S ECARAXMA Y I QLE PLNE 
GFliSRISGLliLCRWTCRHCCQKCYESSCCOSSEDEVEILGPFPA 
OTP P WbKAS R S S DKDGDS VHTA S E V PLTP RTNS PDGRRS S S DTS 
KSTYSLTRRJSSLESRRPSSPLIDIKPIEFGVLSAKKEPIQPSV 
LP.k TYNl'DDYFRKFEPHhYSLuSNbUUvUbhl UbL I JjS K ryiXjM 
LII F S TQY DLLHNHLT VRV I EARDL P PP I S HDG S RQDMAHSNP Y V 
K j CLLPDQKflSKQTGVKRKTQKPVFEER YTFEI PFLEAQRRTLL 
LTWD FDKFS RHCVIG KVSVPLCEVDLVKGGHWWKALI PSSQNE 
VELGELLLSLNYLPSAGRLNVDVIRAKOLLQTDVSQGSDPFVKI 
GLVHGLKLVKTKKTSFLRGTIDPFYNESFSFKVPOEELENASLV 
FTV FGHNMKS SNDF1GR1VIG\QYS SGP\ SEPNHWRRMLNTHRT 
AVEQWHSLRSRAECDRVSPASLEVT 


6971 


37 


3 702 


ACFYVPGSRSFKLIPRHGLVNMGRSGKLPSGVSAKLKRWKKGHS 
SDSNPA1CRKRQAARSRFFSRPSGRSDLTVDAVKLHKELQSGSL 
RLGKSEAPETPHEEEAELVLTEKSSGTFLSGLSDCTNVTFSKVQ 
RFWESNSAAHKE I CAVLAAVTEV I RSQGC KETETEYFAALIRKA 
AQHGVCSVLKGSEFWFEKAPAHHPAAISTAKFCIQEIEKSGGSK 
EATTTLHKLTLLKDLLPCFPEGLVKS CS ETLLR VMTXSHVLVTA 
CAMQAFHSLFKARPGL£TLSAEI*NAQIITALYDYVPSENDLQPL 
LP. WLKVMEKAK 1 NLVRLQWDLGLGHL PRF FGTAVTCLLSPHSQV 
LTAATQSLKE 1 LKE C VAPHMAD I GS VTSSASGPAQS VAKMFRAV 
E EGLT Y KFHAA W SS VLQLLC VF FEACGROAH P VMR KCLQS LCDL 
R LS PHF PHTAJtfjDOAVGAA VTSMGPE VVLQAV PL E 1 DGSEETLD 
FFRSWLLPV3 RDHVQETRLGFFTTY ¥ LPLANTLK S KAMDLAQAG 
STVESKI YDTLQWQMWTLLPGFCTRPTDVAI SFKGLARTLGMAI 
S ERPDLRVTVCQALRTLITKGCQAEADRAEVSRFAKNFLP ILFN 
LYGQPVAAGDTPAPRRAVLETIRT YLT I TETQLVNSLLEKAS EK 
VLDPASSDFTRLSVLDbVVALAPCADEAAISiaYSTIRPYLESK 
AUG VQK KA YR VLEE VCAS PQGPGAL F VQSHLEDL KKTLLDS LRS 
TSS PAKR PKLKCLLHI VRKLSAEKKE FI TALI PE VI LCTKEVS V 
GAR KNA FALL VE MGHAFLR FGSNOEEALQCYLVL I Y PGLVGAVT 
MVSCS I LALTHLLFEFKGLMGTSTVEQLLENVCLLLASRTRDW 
KSALGFI KVAVT VMDVAHLAKHVOLVTJlEAl GKLS DDMRRHFRMK 
LRNLFT\ KF3 P K\ FGI LTV3GKKAVGPKEYHRVLVN I RKAEARAK 
RHRALSQAAVEEEEEEEEEEEPAQGKGDSIEEILADSEBEEDNE 
EE SRS RG KEQR KLARQRSRAWLKEGGGD EPLN FLDP KVAQRVLA 
TQFGPGRGRKKDHSFKVSADGRLIIREEADGNKMEEEEGAKGED 
SEKADPMEDV2 IRNKKHQKLKHQKEAEEEELEIPPQYQAGGSGI 
HRPVAKKAMPGAEYKAKKAKGPVKKKGRPDPYAYI PLNRS KLNR 
R KXWKLOGOFKGLVKAAQR GS QVGKKNR R KDRR P 


6972 


2175 


973 


?GG AI LLPLWRRTRPREATVPRGAAORGRARSAEGR I PSSQS PS 
PAEAGGATRSPPPRPPRPARPPGPSAPPLLRSDAGPGATVSAAA 
AAA ji ERARRGA.TMGAyLs I LGHMVJjr F V Wfr 1> Yoiil»Mi\Jjr UKo I r 
AI 7LES PDIKYPLRLIDREI ISHDTRRFRFALPSPQHILGLPVG 
OKI YLS AR I DGNLWRP YTP I S SDDD KG FVDLVI KVY FKDTHPK 
FPAGGKMS0YLESMOIGDTIEFRGPSGLLVY0GKGKFAIRPDXK 
SN PI 3 RTVKSVGM I AGGTGI TPMLQV I RAIMKDPDDHTVCHLLF 
ANQTE KD I LLR PE LEELKNKHS ARFKLW YTLDRAP E AWD YGQG\ 
FWEEMIRDKLPPPE\EEPLVLMCX5PPPMIQYACLPNL\DHVGH 
PTERCFVF 


6973 


1 


1964 


LQF RCAKRG LRAQKCGRPAPGVDAMVLC PVI GKLLHKRWLAS A 
S P R RQ E I LSN AGLR FE WPS KFKEKLDKA5 FATP YG YAMBTAKQ 
KALEVANRLYQKDLRAPDWIGADT2VTVGGLILEKPVDK0DAY 
RKLSRFE/SGREHSVFTGVAIVHCSSKDHO^DTRVSEFYEETKV 
KFS ELSEELLWEYVHSGEPMDKAGGYGI QALGGMLVES VHGDFL 
NWGFPLNH F CKQL VKLYYP PR PEDLRR S VKK DS I PAADTFEDL 



580 



BNSDOCID: <WO 01S3312A1_I_> 



WO 01/53312 



PCT/LS00/34263 



SEO 
ID 
NO: 


Precictec 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue cf 
amino acic 
sequence 


Predicted enc 
nucleotide 
1 oca t i on 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide j 
(A-Alanine, C=Cysteine, D=Aspartic Acid, E= . 

Gl u t" ami c* Ar*SH FcPhpnvlal?rinP (-~Cl~ vri rp 

H=Histidine, I=Iscleucine, K=Lysane, 
L=Leucine, M=Methicnine, N^Aspararrine, 
P=Proline, Q=Glutamine, R=Arginint, 
S=Serine, T=Threonine, V=Valine, 
V?=Tryptophan, Y-Tyrosine, X=Unknov/n, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion; 








S DVSGGGS E PTQRDAGS RDE KAEAGE AGQATAEAHCHRTR ETLP 

KVDASACGMERLLDICAAMGLLEKTEQGYSKTETANVYLASDGE 
YSLHGFIMHNNDLTWNLFTYLEFAIREGTNOHHRALGKKAEDLF 
ODAYYOSPETRLRFMRAMKGMTKLTACOVATAFNLSRFSSACDV 
GGCTGALARELAREYPRMOVTVFDLPDIIELAAHFQPPGPQAVQ 
IKFAAGDFFRDPLPSAELYVLCRILKDWPDDKVHKLLSRVAESC 
KPGAGLLLVETLLDEEKRVAORALMOSLNMLVOTEGKERSLGEY 


6974 


306>: 


7.172 


RSCAAFASFASRPPLELFAPPGSHRSPPGRGVATSAQCALSVRK 
LLAARPGLGTKYQATMVYKTLFALCILTAGWRVQSLPTSAPLSV 
SLPTNIVPPTTIWTSSPONTDADTASFSNGTHNNSVIiPVTASAP 
TSLLPKN I S I ESREEEI TS PGSNWEGTNTDPS PSGFSSTSGGVH 
LTTTLEEHSLGTPEAGVAATLSQSAAEPPTLISPOAPASSPSSL 
STSPPEVFSASVTTNHS5TVTSTQPTGAPTAPESPTEESSSDHT 
PTSHATAEPVPQEKTP?TTVSGKVMCELIDMET\FPPFPG 


6975 


/. 


SOD ; RPRPTVKCCKWALKLJETAMETLINVFHAHSGKSG^KYKLSKKEL 
! KELLOTELSGFLDVKELML+ATEALKTFEEA* KSP13QCSSSRS 
i SLPPAPQPPPYL*LSAVPFP1HLPLPLLPP0A0KDVDAVDKVMK 
i ELDENGDGEVDFQEYVVLVAAiTVACNNFFWENS 


6976 


1216 


S7C 


GCQb*VAYGTTENSPVTFAHFPEDTVEOKAESVGRIMPHTEARI 
I^MEAGTLAKiNTPGELCIRGYCVMLGYWGEPOKTEEAVDODKW 
YWTGDVATMNEQGFCK I VGR SKDMI I RGGEN I Y PAELEDFFHTH 
PKVQEVOVVGVKDDRMGEEI CACIRLKDGEETTVEEl KAFCKGK 
J SHFKIPKYIVFVTNYPLTI SGKIQKFKLREQMERHLNL* IKQQ 
ACPGRIA 


6977 


J-£ it C 


see 


SLF1NTNLLSNQIRKTSFGMCSEPISDNTED0KGKLKTPDFA*R 
ANKKSKHHVNGNRTVEPrFEGTOMAVFGMGCFWGAERKFWVLKG 
VYS TO VG FAGG Y TSNPTY K E VCS EKTGHAEVV R WYQPEHMS FE 
ELLKVFWENHDPTOGMROGNDHGTQYRSAIYPTSAKQMEAALSS 
KEN Y QKVLSEHGFGP1 TTDI REGQTF YYAEDYHQOY L»S KNPNGY 
CGLGGTGVSCPVGIKK 


6978 


3 


24 2 


SFPFRDSRRCGCCKGSSLRKTAVAMVKLSKEAKCRLOOLFKGSQ 
FAIRWGFI PLVI YLGFKRGADPGMPEPTVLSLLWG 


6979 


3917 


2j46 


DEAR^GEAVAAAILSRCRHWSGPPPFPPSPPDRKGLRGTEPWE 
AGPGSGATPGARAMDVRRLKVNELREELQRRGLDTRGLKTELAJS 
R LOAALE AEE PDDERELDADDEPGR PGH I NEE VET EGGS ELEGT 
AOPP PPGLOPHAEPGGYSG PDGH YAMDNITRQNQF YDTQVI KQE 
NE SG Y ERRPLEMEQQQAYR PEMXTEMXQGAPTS FLPPE ASQL»KP 
DRQQFQSRKP.PYEENRGRGYFEKREDRRGRSPQPPAEEDEDDFD 
DTLVAIDTYI^ CDLHFKVARDRSSGYPLTI EGFA YL WSGARAS YG 
VRRGRVCFEMKINEEISVKHIiPSTEPDPHWRIGWSIjDSCSTQL 
GEEP FS YGYGGTGKKSTNSRFENYGDKFAENDV 1 G C FADFECGN 
DVELS FTKNGX WMGI AFR I Q KEALGGOALYPHVL VKMCAVE FNF 
G QRAE P Y CS VLPG FTF I QHLPLS ER I RGTVG PK S KAECE I LMMV 
G L PAAGKTTWA I KHAASN PS KKYNI LGTNAI MDKMR VMGLRRQR 
NYAGRWDVL1 QQATQCLNRL,! Q1AARKKRNYI LDOTflVYGSAQR 
R KMR P FEGFQRKAI VI CPTDEDLKDRT I KRTDEEGXDVPDHAVI* 
EMKANFTbPDVGDFLDEVLFIELOREEADKLVRQYNEEGRKAGP 
PPEKRFDNRGGGGFRGRGGGGGFQRYENRGPFGGNRGGFQNRGG 
GSGGGGNYRGGFNRSGGGGYSONRWGNNNRDNNNSNNRGSYNRA 
PQQOPPPQQPPPPQPPPQQPPPPPSYSPARNPPGASTYKKNSNI 
PGSSANTSTPTVSSYSPPQSFGFFPSTFQPSYSOPPYNQGGYSQ 
GYTAPPPPPPPPPAYNYGSYGGYNPAPYTPPPPP7AQTYPQPSY 
N0YCOYA0QWNQYYQNOGQWPPYYGNYDYGSYSGNTQGGTSTQ 


6980 


1 


420 


GTRGRKTGRVAAPSTRRRTGNMOKLQTRSPAMSLSDPGLGYHPT 



581 



BNSDOCID: <WO_0153312A1_L- 



WO 01/53312 



PCT/USOO/34263 



SEC- 
ID 
NO: 


Predicted 
bee inning 

location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 

1 QOdLIOi. 

corresponding 
to first 

residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, OCysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=PhenylaIanine, G=Glycine, 
H^Histidine, I=Isol eucine , K=Lysine, 
L^Leucine, M=Methionine, N=Asparagine, 

P-Drrtl i TIP O-fil 11ham» no T3~7, vni ni no 

r — r -hjjl jLiie / y-uiutoiUinc, K K A]fy ininc , 
S=Serine, T=Threonine, V^Valine, 
W=Tryptophan, Y«Tyros ine, >:=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\-possible nucleotide insertion) 








CWTLRWPPLCSbHALHVFHCLFSSRLGTPVSPRLAMDPNCSCEA 
GGSCACAGSCKCKKCKCTSCKKSCCSCCPLGCAKCAQGCICKGA 
SEKCSCCA 


o o i 


10 


1 054 


PGRGFRRASLRPAFAARGVFQGGLGQAKQARTRACAALPTPHPS 
APRLLEPQGVFSLFPPPPGPWPNMILTKAOYDEIAQCLVSVPPT 
RQSLRKLKQRFPSQSQATLLS I FSQEYCKHI KRTHAKHHTSEAl 
ES YYCRYLNGWKNGAAPVLLDliANEVDYAPSLMARLI LERFLQ 
EHEETPPSKSI INSMLRDPSOI PDGVLANQVYOCIVNDCCYGPL 
VDCIKHAIGHEHEVLLRDLLLEKNLSFLDEDOLRAKGYDKTPDF 
I LOV PVAVEGHI I HW I ES KAS FGDECSKHAYLHDQFWS YWNRFG 
PGLVIYWYGFIOELDCNRERGILLKACFPTNIVTLCHSIA 


6962 


153 


1285 


FPQQDCSAPAAPG LAG SEPRRLRAYRRRRQRARGLKRVAW LAPP 
PSLLQGliQGWAOAPVDGTLGPEDSRASSPMIQNSRPSLLOPQDV 
GDTVETLMLHPVIKAFLCGSISGTCSTLLFOPLDLLKTRLQTLO 
PSDHGSRRVGMLAVLLKWRTESLLGLWKGMSPSIVRCVPGVGI 
YFGTLYSLKQYFLRGHPPTALESVMLGVGSRSVAGVCMSPITVI 
KTR Y ESG KYG Y ES I Y AALR S 1 Y HS EGH RGLFS GLTATLLRDAP F 
SG I YLMFYNQT KNI VPHDQVDATLI P 1 TNFS CG I FAG I LASLVT 
0 P ADV I KTHMQLYPL K FQW 3 G 0AVTL1 F KD YG liRGFFQGG I PRA 
LRRTLMAAMAWTVYEEMMAKKGLKS 


6983 


82 


773 


EMS FLQDPS FFTMGMWS I GAGALGAAALALLLANTDVFLS KPQK 
AALEYLEDIDLKTLEKEPRTFKAKELWEKN(5AVIMAVRRPGCFL 
CREEAADLSSLKSKLDQLGVFLYAWKEHIRTEVKDFOPYFKGE 
IFLDEKKKFYGPORRKMMFMGFIRLGVWYNFFRAWNGGFSGNLE 
GEG FILGG VP WGSGKQG I LLEHREKEFGDKVNLLSVLEAAKM I 
KPQT1ASEKK 


6984 


1845 


1282 


GGRSAYSLPAGSLPRVPATAAAKMASGVQVADEVCRIFYDMKVR 
KCSTPEEIKKRKKAVIFCLSADKKCI I VEEGKEILVGDVGVTI T 
D P F KH FVGMLPEKDCR YALYDAS FETK E S RKEELMFFLWAPELA 
PLKS KMI YASS KDAI KKKFQG I KHECQANGPEDLNRACI AEKLG 
GSh 2 VAFEGCPV 


6985 


1887 


1324 


RRTAGIYPCFPKPGRTRHALCSWLLLLTGQLAFDDFCESCAMM 
WOKYAGSRRSMPLGARILFHGVFYAGGFAIVYYLIQKFHSRALY 
YKLAVEQL0SHPEA0EA1>GPPLiNIHYLKLIDRENFVDIVDAKLK 
i pvsgsksegllyvhssrggpfqrwhldevflelkix30qi p vfk 
LSGENGDEVKKE 


6986 


642 


1350 


YHLYFKMGDPNSRKKQALNRLRAOLRKKKESLADQFDFKMYIAF 
VFKEKKXKSALFE VS EVI P VMTNNY E EN I LKG VRDS S YS LE S SL 
ELLOKDWQLHAPRYOSMRRDVIGCTOEMPFILWPRNDIEKIVC 
LLFSR W KESDEPFR P VQAX F E FHHGD YE KQ FLHVLS R KD KTG I V 
VNNPNQSVFLFl DRQHLQTPKNKATI FKLCS I CLYLPQEQLTHW 
AVGTI EDHLRPYMPE 


6987 


1623 


341 


LEAAEKASRAFKESORQTDS KNYETEN WS POKSQRRYDMYNTAC 
FLGE 1 E VGLYTI 0 1 LQLTPF FHKENELS KKHMVQFLSGKWTI PP 

nnnMrr'VT t\t £?VE"PCLrT itmt /ipr\T t/D/™Tr\TTt?T nviun tt t ia/uvta 
urliribtK. I LiHiXiSm I bHJbKWiiySDLiKKv.r Dr v lDxnvhLKriRiTQ 

KEIAEIMLSKKVSRCFRKYTELFCHLDPCbLQSKESQLLQEENC 

RKKLEALRADRFAGLLEYLNPNYKDATTMESIVNEYAFLLQQNS 

KKP>frWEKQNSILANIILSCLKPNSKLIO?LTTLKKQl»REVLQF 

VGLSHQ YPGP YFLACLLFW P ENQELDQDS KLI EKYVS S LNRS FR 

GQYKRMCRSKQASTI.FYLGKRKGLNS1VHKAKIE0YFDKA0NTN 

SLWHSGDVWKKNE VKDLLRR LTGQAEGKLI S VEYGTEEKIKI P V 

ISVYSGPLRSGRNIERVSFYLGFSIEGPPGL 


6988 


3 


689 


TOLLRRPAVFVGSAASGIRSGLWSASSGHWCAPAAGRAHAPVPR 
LVRGLGAASTAAPQDAQTGP0PMPRADC1MRHLPYFCRGQWRG 
FGRGSKQI/5I PTANFPEQWDNLPADl STGI YYGWAS VGSGDVH 
KM WS IGWNPYYKNTKKSMETH I MHTFKEDFYGEILRVAI VGYL 



582 



BNSDOCID: <WO 0153312A1_I_> 



WO 03/53312 



PCT/US00/34263 



SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C*Cysteine, D=Aspartic Acid, L= 
Glutamic Acid, F ^Phenylalanine, G=Glycine, 
H^Histidine, I-Isoleucine, K-Lysine, 
L=Leucine, M=Methicr.ine , N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T= Threonine, V=Valine, 
W= Tryptophan, Y=Tyrosine, X=Unknovm, +=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








RPEKNFDSLESLlSAIOGDa EEAKKRLELPEHLK1 KEDNFFOVS 
KSKIMNGH 


6989 


2 


me 


LMPSDRPLSPSTHASAGSKCHAPPTTARRAFPIPFGSKSNMATL 
KDQLIYNLLKEEQTPONKJTWGVGAVGKACAISILMKDLADEL 
A L VDV I E D KL KG EMMDLQHG S L FLRTP K I VS G KD YNVTANS KL V 
IITAGAROQEGESRLNLVOKNVNIFKFIIPNWKySPNCKLLlV 
S N P VD I L7 Y VAW KISGFPKNRYI G SGCNLDS ARFRY LMGER LG V 
HPLSCHGWLGEKGDSSVPWSGf4NVAGVSLKTLKPDLGTDKI)K 
EQWKEVHKQWESAYEVIKL-KGYTSWAIGLSVADLAESIMKNLR 
R VHP VS TM I KGLYG I KDDVFLS VPC I LG0NG I S DLVXVTLTS EE 
EARLKKSADTLWG I QXELQT 


6990 


719 


258 


THASGKAS WLALRTRTAVTS LLS FTPATALAVRYAS K KSGGS S 
KNLGGKSSGRROG IKKMEGHYVHAGNI I ATQRHFRWHPGAHVGV 
GKNKCLYALEEGIVRYTKEVYVPHPRNTEAVDLITRLPKGAVLY 
KTFVHWPAKPEGTFKLVAML 


6991 


169 


453 


RRSSDFHNPGFLSRPVSLREN3HHQV1CSTKNKRRNPKKIAYLL 
SSLLMTNLNPNESTENQPVBAYWAFTLDQEFLTYACVEGTGCLF 
CGRHVH 


6992 




510 


RQAPGCSSLALRCVRQVYCGLVRAPQVQTRPLSSRFVERRGALY 
RSPMNQENPPPYPGPGPTAPYPPYPPQPMGPGPMGGPYPPPCGY 
PYQGYPQYGWQGGPQEPPKTTVYWEDORRDELGPSTCLTACWT 
ALCCCCLWDMLT 


6992 


a 


374 


OWCVTCPOHNARCGPAVPPGIQAYGAAPFEDLOVDFTEMSKCRG 
DRVWIKNWNVASLCPLWKGPCTWLSPPTAVKVEG1PAWIHHSH 
VK PAAR ET WEAR PS PDNP FR VTLK KTTS PAP VTPGS 


6994 


346 


HOC 


QW PEKDP VI4AASS I S S P WG Kj-IVFKAI LMVLVAL r LLKSAIiACS R 
RDFAPPGQQKREAPVDVLTQa GRS VRGTLDAWI GPETMHLVSES 
SS0VLWAI SSAI SVAFFALSG I AAQLLNALGLAGDYLAQGLKLS 
PGOVQTFLLWGAGALWYWLLSLLLGLVLALLGRI LWGLKLV3 F 
LAGFVALMRS V PDPSTRALLLLALLI LYALLS RLTGSRASGAOL 
EAKVRGLERQVEELRWRORRAAKGARSVEEE 


6995 


144 


1346 


GS VAVGLSG I MAAQKDLWDAI V J G AG I QGCFTAYHLAKHRKR I L 
LLEQFFLPHS RGS SHGQSR 1 1 RKA YLEDF YTRMMHECYQ I WACL 
EHEAGTOLHROTGLLLLGMKENOELKTIOANLSRORVEHOCI,^ 
EELKQRFPNIRLPRGEVGLLDNSGGVIYAYKALRALQDAIROLG 
GIVRDGEKWEINPGLLVTVKTTSRSYQAKSliVITAGPWTNOLL 
RPLGIEMPLQTLRINVCYWREMVPGSYGVSQAFPCFLWLGLCPH 
HIYGLPTGEYPGLMKVSYHHGNHADPEERDCPTARTDIGDVQ1L 
SSFVRDHLPDLKPEPAV1ESCMYTNTPDEQF1LDRHPKYDNIV3 
GAG FSGHG F KLA P WG K I L YE LSM XLTPS YDLAP FR I S R FPS LG 
KA11L 


6996 


542 


1942 


etanae aaar ks amdw kevlrrrbatpntcpnkkkseqelkdee 
mdlftkyysewkggrkntnefykt3prfyyrlpaenevllqklr 
ees ravflqr ks relldnee lqnlw flldkhqtppm 1gee am i n 
yenflkvgekagakckqfftakvfakllhtdsygri s imqffny 
vmrkvwlhqtr i glslydvagogylresdleny i leliptlpql 
dgleksfysfyvctavrkffffldplrtgkikiqdiiacsfldd 
llelr3eelskesqetnwfsapsalrvygqylnldkdhngmlsk 
eelsrygtatmtm^fldrvfoecltydgemdyktyldfvlalen 
rkepaaloyifklldienkgylnvfslwyffraiqelmkihgod 

P VS FQD VKDE 1 FDM VKP KD P L K IS LQDLINSNQG DT VTT I L I DL 
NGFWTYENREALVANDSENSADLDDT 


6997 


370 


1104 


AMELTIF1LRLAIYILTFPLYLLNFLGLWSWICKKWFPYFLVRF 
TV I YNEQMAS KKRELFSNLQE FAG PS GKLSLLEVGCGTGAN FKF 
YPPGCRVTCI DPN PNFEKFLI KS I AENRHIjOFERFWAAGENMH 



583 



BNSDOCID: <WO_0153312A1_L; 



WO 01/53312 



PCT/USO0/342()3 



SEC 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amine acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Arazno acid segment containing signal peptide 
(A«=Alanine, C=*Cysteine, D«Aspartic Acid, Es 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Histidine, I=lsoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
F=Froline, 0=Glut amine, R=Arginine, 
S=Serine, 7=Threonine f V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








QVADGSVDWVCTLVLCSVKNQERIljREVCRVLRPGGAFyFMEH 
VAAE CSTWNY FW0OVLDPAWHLLFDGCNLTRE S W KALERASFSK 
LKLOH I QAPLS WELVRPH I YGYAVK 


6998 


2 


616 


FVSRAuLRVRSRRHPAEERAAPGRPEDAPIECPGATNCPEPLWC 
SHLPVPYAPPTMESRGKS AS SPKPDTKVPQVTTEAKVP PAADGK 
APLTKPSKKEAPAEKOQPPAAPTTAPAKKTSAKADPALLI'JIIHSN 
LKPAPTVPSSPDATPEPXGPGDGAEEDEAASGGPGGRGPWSCEN 
FNPLLVAGGVAVAAI ALI LGVAFLVRKK 


6999 


14 


1591 


GRAGACSRRDTAI-lSIElESSDVIRLIMOYbKENSLHRAlATLQE 
ETTVSIiNTVDSIESFVADINSGHWDTVLQAIOSLXLPDKTLlDL 
YEOWLELIELRELGAARSLLRQTDPMIMLKQTOPERYIHLENI, 
LARS Y FDPREAY PDGSS KEKRRAAI AQALAGEVS WPF SRLMAL 
LGQALKWQOHQGLLPPGMTI DLFRGKAAVKDV EE EKPPTQ LSRH 
I K FGQ KSHVECARFSPDGQ YLVTGSVDGFI EVVW FTTG KI RKDL 
KYQAQDNFMMMDDAVliCMCFSRDTEMLATGAODG KI KVWK IQSG 
□CLRR FERAHS KGVTCLS FS KDSSQI LS AS FDQTIR IHGL KSGK 
TLKEFRGHSSFVNEATFTQDGHY11SASSDGTVKIWNMKTTECS 
NTFKS LGSTAGTDI TVNS VI LLPKNPEHFWCNR SNTWI MNMQ 
GQ3VRSFSSGKREGGDFVCCALSPRGEWIYCVGEDFVLYCFSTV 
TGKLERTLTVKEKDVI G I AHHPHQNLI ATYSEDG LLKLWKP 


7000 


2 


B27 


GPGWFLELKESEGPPESERSEFFSQREEENEEEEAQEPEETGP 
KNPLLQPALTGDVEGLCKI FEDPENPHHEQAMOLLLEEDI VGRN 
LLYAACMAGOSDVI RALAKYGVNLNE KTTRGY TLLH CAAAWGRL 
ETLKALVELDVD I EALNFRE ERARDVAAR Y SQTECV EFLDWADA 
RLTLKKYIAKVSIAVTDTEKGSGKLLKEDKNTILSACRAKI-IEWL 
ETHTEASINELFE0RQ0LEDIVTPIF7KMTTPC0VKSAKSVTSH 
DQKRSODDTSN 


7001 


2056 


844 


RRCL 1 1 AFLKGC FIFIYFIFI FETEFLSCCPG WS AVAQSR 11 AN 
FASQVQAI FI LPKDSQVGPD VKSEAAPKRAI.YES VFGSC-E I CGP 
TSPKRLCI RPSEPVDAWWSVKHDPLPI.LPE ANGHRS TM S PTI 
VSFA3VSPT0DSRPNMSRPLITRSPASPLNN0GIPTPAOLTKSN 
APVHIDVGGHMYTSSLATLTKYPESRIGRLFDGTEPIVLDSLKQ 
H Y F 3 DRDGQM FR Y I LNFLRTSKLL I P DD FKDYTLL Y EE AK Y FQL 
QPKLLEKERWKQDRETGRFSRPCECLWRVAPDLGERITLSGDK 
SLIEEVFPEIGDVI1CNSVNAGWNHDSTHVIRFPLNGYCHLNSVQ 
VLERLQQRGF E 1 VG SCGGGVDS SQFS E YVLRR ELRRTPR V P S V I 
RIKQEPLD 


7002 


2043 


4 98 


PMPSSTRWTTS*TYTDTSSAWACRPTTGTCT*TAAPGPTVRWWP 
TPCSRHQSRRRLTCWCSTSRPCGR*GGLCVRTAPTRPTTSASSS 
SWTSAGTSWPAGRRTGTATSGTATTTSVWPGCGTRMWSTOWSSV 
PRSRSCCSRPATTPPSKPGAPHAPCASSRHLAHGLAPSSPGLPA 
RGAEVC 


7003 


818 


61 


QGRFRAFCWQRDFLQPPGMRLSALLALASKVTLPPHYRYGMSPP 
GS VADKRKNP P W I RRRPVWEPIS DEDW YLFCGDTVE I LEGKDA 
GKOGKWQVIRORNWVWGGLNTHYRYIGKTMCYRGTMIPSEAP 
Lli^ROVKLVDPMDRKPTEIEWRFTEAGERVRVSTRSGRI I PKPE 
FPRADG I VPETW I DGPKDTS VEDALERTY VPCLKTLQEEVWEAM 
GI KETR \NTRRS IG IS PGAEQLLPNFCPSbEG 


7004 


121 


2285 


FLLPVLTSRSLRQPAVPHARLGGVEPAAMKSARAKTPRKPTVKK 
G\PKRTLKTOLG/YYCRVRPLGFPDQECCIEVINNTTVQL«TPE 
GYRLNRNGDYKETQYS FKQV FGTHTTQKELFDVVAN PLVNDL I H 
GKNGLLFTYGVTGSGKTHTMTGSPGEGGLLPRCLDMI ?NS 1GSF 
QAKRYVFKSNDRNSMDIQCEVDALLERQKREAMPNPKTSSSKRQ 
VDPE FADM I TVOEFCKAEE VDEDS VYGVFVS Y I EI YNNYI YDLL 
EEVPFDPINPNLHNLNCFVKIKNHNMYVAGCTEVEVKSTEEAFE 
VFWRGQKKRRIANTHLNRESSRSHSVFNIKLVOAPLDADGDNVL 



584 



BNSDOCID: <WO 0153312A1 J_> 



WO 01/53312 



PCT/US00/34263 



SEQ 
ID 

KO: 


Predicted 
beginninc 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
co first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, OCysteine, D=Aspartic Acid, E= 
Glutamic Acid, F= Phenylalanine, G=Glycine, 
H=Histidine, I=:soleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P=Proline, O-Glutamine, R=Arcinine, 
S=Serine, T=Threonine, V=Valine, 
W«Tryptophan, Y=Tyrosine, X^Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








QEKEQITI SQLSLVDLAGSERTNRTRAEGNRLREAGN I NQSLMT 
LRTCMD\OjREN0MYGTNKMVPYRDSKLTHLFKNYFDGEGKYRMI 
VCVNPKAEDYEENLOVr/IRFAEVTOEVEVARPVDKAICGLTPGRR 

yrnqprgp\igneplvtdwlqsfpplpsceildindeotlprl 
I ealekrhnlrqmmi defnkqsnafkallcefdnavlskenhmq 
gklnekekmisgokleierlekknktleykieilektttiyeed 
krnlqoeletqnoklorofsdkrrlearlogmvtettmkwekec 
errvaakolemonklwvkdeklkqlkaivtepktekperpsrer 
drekvtqrsvspspvpvsyl 


7005 


63 


876 


RNMALYQRWRCLRLOGLQACRLHTAWSTPPRWLAERLGLFEEii 
V7AAQVKRLASMA0KEPRTI KI SLPGGQK1 DAVAWNTT P YQLARQ 
ISS TLADTAVAAOVNGE PYDLER PLETDSDLR FLTFDS PEGKAV 
F WHSSTH VLG AAAEQ FIjGAVLCRG P STE YG FYH DF FLG KERT 1 R 
G S ELP VLER I COELTAAAR P FRRLEAS RDQLRQ LFKDN P FKLH h 
I E E KVTG PTATVYG CGTLVDLCOGPHLRHTGO I GG LKLLSNS S S 


7006 


22 


898 


NAFGRKSTAVKMAAAAWLQVLPVILLLLGAHPSPLSFFSAGPAT 
VAAADRS KWH I PI PSGKNYFSFGKI LFRNTTI FLKFDG EPCDLS 
LN I TWYLKS ADC YNE I YNFKAEEVELYLEKLKEKRGLSGKYQTS 
SKLFQKCSELFKTQTFSGDFMHRLPIJLiGEKQEAKENGTNLTFIG 
DKTAMKEPLQTWQDAPYIFIVHIGISSSKESSKENSLSNIiFTMT 
VEVKGPYEYLTLEDYPLMIFFMVMCIVYVLFGVLWLAWSACYWR 
DLLRIQFWIGAV1 FLGMLEKAVFYAGFQ 


7 007 


2 


i nm 

1UU1 


2i MT\7 VnTV K* DT5 D ft ^ V(Z Zi Q Q wp r*T .P V FfVf J P T . V KPHH VCO. 21 T Ji A 
>U*i L V Ouru 1 c Cm ir T\. if j\ X * v^Z-iO o V E*V^ urtrvxAJA sj ur i\(.uu - wwvt 

YTQAI^LDATPODQAVLHPJ^RAACHUCLEDYDKAETEAS KAI EK 
DGGDVKALYRRSOAiEKLGRLDQAVLDLORCVSLEPKNKVFOEA 
LRNIGGQIQEKVRYMSSTDAKVEQMFQILLDPEEKGTEKKQKAS 
QNLWLAR EDAGAE K I FRSNGVGLLQRLLDMGETDLMLAALRTL 
VG I CSEHQSRTVATLS I LGTRR WS ILGVE S QAVSLAACHLLQV 
MFDALKEGVKKGFRGKEGAIIVGEWKQVWGLLDVTVMEGMGLSQ 
PGQFFGDQTCSCRLFGI R FGDI ILL 


7008 


70 


1478 


CRSALGHERPPPAHLPAGGRRLQTCPRSCRWLGRPPSGL.PPGPR 
S P P PLAG PGQKMVQKKPAELQG FHRS FKGQN PFELAFSLDQPDH 
GDSDFGLQCS AR P DM PAS QP I D I PDAKKRG KK KKRGRATDS FSG 
PFFnwni.nPDVI ,nFf5ZxHZXl?Vr>Tf , TNlj3T < ?OEYAVKI I EKOPGH 
IRSRVFREVEMLYQCQGHRNVLELIEFFEEEDRFYLVFEKMRGG 
S I LSH I H KR RH FNE LE AS WVQDVAS ALD F LHN KG I AK RDLKPE 
N I LCEHPNQVS P VKI CDFDLGSG I KLNGDCS P I STPELLTPCGS 
AE YMAPEWEAFSEEAS I YDKRCDLWSLG V I LYILLSG YPPFVG 
RCGSDCGWDRGEACPACQNMLFESIQEGKYEFPDKDVIAHISCAA 
KDLIS KLLVRDAKQRLS AAQVLQHPWVQGCAPENTL PTPMVLQR 
WDSHFLLP PHPCRI HVRPGGLVRTVTVNE 


' 7009 


1 


626 


ARQLRNSWVDDr VAAPLI PLSQQI PTGNSLYESYYKQVDPAYTG 
RVGASEAALFLKKSGLSDI ILGKI WDLADPEGKGFLDKQGFYVA 
LR LVACAOS GH E VTLSNLNLSMP P P KFHDTS S P LMVT P PS AEAH 
WAVRVEEKAKFDGIFESLLPINGLLSGDKVKPVLMNSKLPLDVL 
GRVWDLSD I DKDGHLVRDEFAVAMHLVYRALE 


7010 


79 


571 


SHTRRAWPETLLSPLCPLLGGGTAMSGGEQKPERYYVGVDVGT 
GSVRAALVDQSGVLLAFADQPIKNWEPQFNHHEQSSEDIWAACC 
WTKKWOGIDLNQIRGLGFDATCSLWLDKQFHPLPVNQEGDS 
HRWIMWLDHRAVSQVNR INETKHS VLQYVGG 


7011 


3 


994 


R 1 QTLPNQNQ S QTQ PLLXTP PAVLQPI APQTT FGVQTQ PQPQS L 
LOAQISAASITPLLQTQPQPLLQQPQQKAGLLQPPVRIVSQPQP 
ARRLDPPSRFSGRNDRGDQVPNRKDDRSRERERERRRSRERSPQ 
RKRSRERSPRRERERSPRRVRRWPRYTVQFSKFSLDCPSCDMM 
ELRRR YQNL Y IPS DFFDAQFTWVDAFPLS R P FQLGN Y CNF YVMH 
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BNSDOCID: <WO_0153312A1 J. : 



WO 01/53312 



PCT/US00/34263 



SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acic segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine . 
P=Proline, 0=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Val int, 
W^Tryptophan, Y*Tyrosine, X=Unknown, +=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








REVESLE KNMAI LD P PDADHL Y S AKVMLMAS ? S MEDLYH KSCAL 
AEDPQELR DG FQKPARLVKFLVGMKGKDE AMA 1 GGH W S PSLDGP 
DPEKDPSVLIKT\AIRCCKALTG 


7012 


1 


2661 


RRAGSVKRGEARLFGPTERQSERPLRPSAARRPEMLSGXKAAAA 
AAAAAAAATGTEAG PGTAGGS ENG S EVAAQ P AG LS G P AE VG PGA 
VGERTPRKKEPPRASPPGGLAEPPGSAGPQAGPTWPGSATPME 
TGIAETPEG\RRTSRRKRAKVEYREMDESLANLSEDEYYSEEER 
NAKAEKEKKLPPPPPQAPPEEENESEPEEPSGVEGAAFQSRLPH 
DRMTS0EAACFPDI1SGPQGTQKVFLF3RKRTL0LWLDNPKIQL 
TFEATLQQLEAP YNSDTVLVKRVHSYLERHGLINFG I YKRIKPL 
PTKKTGKVI 1 1GSGVSGLAAARQLQSFGMDVTLLEARDRVGGRV 
A7FRKGNY VADLGAMWTGLGGNPMAWS KQVNMELAK I KQKCP 
LYEANGQAVPKEKDEMVEOEFNRLIiEATSYLSHQLDFNVLNNKP 
VSLGOALEWIQLQEKHVKDEQIEHWKKIVKTOEELKEIjLNKKV 
NLKEKIKELHQQYKEASEVKPPRDITAEFLVKSKHRDLTALCKE 
YDELAETOGKLEEKLQELEANP PSDVYLS SRDRQ I LDWH FANLE 
F AN ATPLS TLS LKH WDQ DDDFE FTGS HLTVRNG YSC V P V ALAEG 
LDI KLNTAVRQVRYTASGCEVIAVNTRSTSQTFI YKCDAVLCTL 
PLGVLKQOPPAVOFVPPLPEWKTSAVQRKGFGNLNKWLCFDRV 
FWD P S VNL FGHVGS TTASRGELFLFWNLY KAP I LLALVAGE AAG 
IMENI SDDVI VGRCLA1 LKGI FGSSAVPQPKETWSRWRADPWA 
RGSYSYVAAGSSGNDYDLMAQPITPGPSIPGAPQPIPRLFFAGE 
HTIRNYPATVHGALLSGLREAGRIADQFLGAMYTLPRQATPGVP 
AQQSPSN 


7013 


1 


2661 


RRAGSVKRGEARLFGPTERQSERPLRPSAARRPEMLSGKKAAAA 
AAAAAAAATGTEAGPGTAGGSENGSEVAAQPAGLSGPAEVGPGA 
VGERTPRKKEPPRASPPGGLAEPPGSAGPOAGPTWPGSATPME 
TG1A£TPEG\RRTSRRKRAKVEYREMDESLANLSEDEYYSEEER 
NAKAE KEKKLP P P P PQAP PEEENES E PEE P S G VEG AAFQS RLPH 
DRMTSQEAACFPDIISGPQQTQKVFLFIRNRTLOLWLDNPKIQL 
TFEATLQQLE AP YNSDTVLVHRVHS YLERHGL INFG1 YKRIKPL 
PTKKTGKVI 1 1 GSGVSGLAAARQLQS FGMDVTLLEARDR VGGRV 
ATFRKGNY VADLGAMWTGLGGNPMAWS KQVNMELAK I KQKCP 
LYEANGQAVPKEKDEMVEQEFNRLLEATSYLSHQLDFNVLNNKP 
VSLGQALEWIQLQEKHVKDEQIEHWKKIVKTQEELKELLNKMV 
NLKEKI KELHQQYKEASEVKPPRDITAEFLVKS KHRDLTALCKE 
YDELAETQGKLEEKLOELEANPPSDVYLSSRDRQILDWHFANLE 
FANATPLS TLS LKH WDQDDDFEFTGSHLTVRNG Y S CVP VALAEG 
LDI KLNTAVRQVRYTASGCE VIAVKTRSTSQTF I YKCDAVLCTL 
PLGVLKQQPPAVQFVPPLPEWKTSAVQRMGFGNLNKWLCFDRV 
FWDPS VNLFGHVGSTTASRGELFLFWNLYKAP I LLALV AGEAAG 
IMENI SDDVI VGRCLAI LKGI FGSSAVPQPKETWSRWRADPWA 
RGS YS YVAAGS SGNDYDLMAQPITPGPS I PGAPQP I PRLFFAGE 
HTI RN YPATVHG ALLSGLREAGRIADQ FLGAM YTLP RQATPG VP 
AQQSPSM 


7014 


3 


3950 


DFE VGDK I R I LATLEDG WLEGS LKGRTGI F P Y R F V KLC PDTR VE 
ETMALPQEGSLAR I P ETSLDCLENTLG VEEQRHETS DHEAEE PD 
CI I SEAPTSPLGHLTSEYDTDRNSYQDEDTAGG PPRS PGVEWEM 
PLATDSPTSDPTEWNGISSQPQVPFHPNLQKSQYYSTVGGSHP 
HSEQYPDLLPLEARTRDYASLPPKRMYSQLKTLQKPVLPLYRGS 
S VS AS RW KPRQS S PQLHNLAS YTKKHHTS S VYS I S ERLEMKPG 
PQAQGLVMEAATHSQGDGSTDLDSKLTQQLIEFEKSLAGPGTEP 
DKILRHFSIMDFNSEKDIVRGSSKLITEQELPERRKALRPPPPR 
PCTPVSTSPHLLVDQNLKPAPPLWRPSRPAPLPPSAQQRTNAV 
SPKLLSRHRPTCETLEKEGPGHMGRSLDQTSPCPLVLVRIEEME 
RDLDMYSRAQEELNLMLEEKQDESSRAETLEDLKFCESNIESLN 
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BNSDOCID: <WO 0153312A1J_> 
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PCT/US00/34263 



SEQ 
ID 
NO: 


Precictec 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A^Alanine, C-Cvsteine, D«Aspartic Acid, E= 
Glutamic Acid, ^Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, 0=Glutamine, R=Argimne, 
S=Serine, T=Threonine, V^Valine, 
W=Tryptopban, Y=Tyrosine, JUUnknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\s=possible nucleotide insertion- 








MEi>Q0LREKTLLSSOSSSLVAPSGSVSAENP50R^LEKRAKVIE 

ELLQTERDY I RDLEMCI ERIMVPMQQAQVPN I DFEGLFGNMQMV 

IKVSKOLLAALEI SDAVGPVFLGHRDELEGTY KI YCQNHDEAI A 

LLEIYEKDEKIOKHLODSLADLKSLYNEWGCTWyiNLGSFLIKP 

VQRVMRY PLLLMELLNSTPESKPDKVPLTNAVLAVKE 1NVN IKE 

YKRRKDLVLKYRKGDEDSLMEKISKLNIHSI I KKSNRVSSHLKH 

LTGFAPQIKDEVFEETEKNFRMQERLIKSPIRDLSLYLQHIRES 

ACVKWAAVSMWDVCMERGHRDLEQFERVHRY I SDQLFTNFKER 

TERLVI SPLNQLLSMFTGPHKLVOKRFDKLLDF YUCTE RAEKLK 

DKKTLEEL0SARNNYEALWAQLLDELPKFHOYAOGLFTNCVHGY 

AF.AHrnFVKO&T.FOT.KPl ,T.^T.T,KVkGRFfiNl,TATFHFPW^f5VL,n 
tv-inns^ur vri^/\ui-«v/i- , i\ uuo uun v .numsv?!* u — -rti r ncc.nonv uy 

QLQVFTFFPESLPATKKPPERKT1DRQSARKPI»LGLPSYMLQSE 
ELRAS LLARYP PEKLFQAERNFNJiAQDLDVS LLEGDLVG VI KKK 
DPMGS0NRWL1DNGVTKGFVYSSFLKPYNPRKSHSDASVGSHSS 
TESEHGSSSPRFPRQKSGSTLTFNPN\S\MAVSFTSGSCQKQPQ 
DASPPPKEWDQGTLSASLNPSNSESSPSRCPSDPDSTSQPRSGD 
SADVARDVK0P7ATPRSYRNFRHFEIVGYSVPGRNGQSODLVKG 
CARTAOAPEDRSTE PDGSEAEGKQVYFAVYTF KARN PNELS VSA 
NQKLKI LEFKDVTGNTE WWIAEVNGKKGYVP SN Y I RKTE YT 


7015 


1B4 2 


513 


RQAWHE \ VAAP S WRGARLVQS VLR V\-f QVGPHVARERV1 p FSSLL 
GFORRCVSCVAGSAFSGPRLASASRSNGQGSALDHFLGFS0PDS 

KSTLSNQLLGR KVFP VSRKVHTTRCQALGVI TEKETOVI LLDTP 
GI1SPGKQKRHHLSLSLLEDPWKSMESADLVWLVDVSDKWTRN 
QLS PQLLRCLTK YSQ I PS VLVMNKVDCLKQ K S V hh ELTAALTEG 
WNGKKLKMROAFHSHPGTHCPSPAVKDPNTOSVGNPQRIGWPH 
FKE3FMLSALS0EDVKTLKQYLLTQAQPGPWEYHSAVLTSQTPE 
EICANI IREKLLEHLPOEVPYNVQQKTAVWEEG PGGELVIQQKL 
LVPKESYVKLLIGPKGHVISQIAQEAGHDLMDIFLCDVDIRLSV 
KLLK 


7016 


167 


2513 


IbNAP KPP PPRDS VEAVAAKRDTGGGS WGTGMDVSGQETDWRST 
AFROKLVSOIEDAMRKAGVAHSKSSKDMESHVFLKAKTRDEYLS 
LVARLI I HFRD I HNKKSQASVSDPMNALQS LTGGP AAGAAG I GM 
PPRGPGOSLGGMGSLGAMGQPMSLSGQPPPGTSGMAPHSMAWS 
TATFOTOLOLOOVAAAAAAATARSSSSSSRRKySSSSSSSNSKQ 
FQAQQS7MQQ\QFQh\ WQQQQQL\QQQQQQQQHL I KLHHQNQQ 
QICKXX?QQLQRIAQLQLQO^C^(^G^C^G^CQALQAQPPIQQP 
PMQQPOPPPSQALPOQbQOMHHTQHHOPPPOPOOPPVAQNQPSQ 
LPPOSQTQPLVSQAQALPGQMLYTQPPIjKFVRAPMWQQPPVQP 
qvqqqqtavqtaoaaqmvapgvqvsqs s l pm ls s p s pgqqvqtp 
OSMPPPPOPSPOPGOPSSQPNSNVSSGPAPSPSSFLPSPSPQPF 
\QS PVTARTPQN FS VPS PGPLNTPVNP S S VMS PAG S SQAEEQQ Y 
LDKLKQLSKYIEPLRRMINKIDKNEDRKKDLSKMKSLLDILTDP 
SKRCPLKTLQKCEIALEKLKNDMAVPTPPPPPVPPTKQOYLCOP 
LLDA VLAN I RSP VFNHSLYRTFVPAMTA I HG P P 1 TAP WCTRKR 
RLEDDEROS IPSVLOGEVARLDPKFLVNLDPSHCSNNGTVHLl C 
KLDDKDLPSVPPLELSVPADYPAOSPLWIDRQWOYDANPFLOSV 
HRCMTSRLLQLPDKHSVTALLNTWAQSVHOACLSAA 


7017 


1 


1785 


:nlgntcymnsvi*alfmatdfrrqvlslnlngcnslmkklohl 
fafuuitqreayapr i ffeasrppwftprsqcdcseylrflldr 
lheeekilkvoashkpseilecsetslqevaskaavltetprts 
dgektliekmfggklrth1rclncrstsqkaeaftdlslafwps 
ysleymscpdcscspsiqdgglmqasvpgpseepwywpttaaf 
1cdslvnektigsppnefycsentsvpnesnkilvnkdvpqkpg 

GETTPSVTDLLNYFLAPEILTGDNOYYCENCASLOKAEKTMQIT 
EEPEYLI LTLLR FS YDQK YHVRRKI LDNVSLPL VLELP VKRITS 
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WO 01/53332 



PCT/US00/34263 



SEO 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted tnc 
nucleotide 
location 
corresponding 
to first 
amino acic 
residue of 
amino acic 
sequence 


Ammo acid segment containing signal peptide 
lA=Alanine, C=Cysteine / E=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, • 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, ^Methionine, N-Asparagine, 
P=Proline, Q=Glut amine, R=Arginine, 
S=Serine, T=Threonine, V=V?.;me, 
W*Tryptophan, Y*Tyrosine, X^Unknown, *-Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








FSSLSESVJSVBVDFTDLSENLAKKLKPSGTDEASCTKLVPYLLS 
SVWHSGISSESGHYYSYARNITSTDSSYOMYHQSEALAIASSQ 

S H LLGRDS PS av feqdlenk ems k e w flfjjds rvtfts fqs vqk 

I TSR FF KDTAYVLLYKKQHSTNGLSGNKPTSGLWINGDPPLQKE 
LMDA I T KDNKLY LQEQELN APJ^RALOAAS AS CS FRPNG FDDKDP 
PG S CGPTGGGGGGGFNTVGRLVF 


7018 


484 


1066 


S liV FRGNTWS GE AGHHCS ALFNLAAY KOLF VGTER I RAPE 1 1 FQ 
PSL1GEEQAGIAETL0YILDRYPKDV0EMLVQNVFLTGGNTMYP 
GMKARMEKELLEMRPFRSSFQV0LASNPVLDAWYGARDWALNH1» 
DDNEWITRKEYEEKGGEYLKEHCASNI YVPIRLPKQASRSSDA 
OASSXGSAAGGGGAGEQA 


701S 


1046 


335 


APGG FLVTMVFPAPS P PWMLGCCSHE VTAG P PTLCKDMS ALVAA 
RMRH I PLAPGSDWRDLPN I EVRLSDGTKARKLR YTHHDRKNGRS 
SSGALRGVCSCVEAGKACDPAAROFNTLIPWCLPHTGNRHNHWA 
GLYGRLEWDGFFSTTVTNPEPMGKQGRVLHPEQHRWSVRECAR 
SQGFPDTYRLFGNILDKHROVGNAVPPPLAKAIGLEIKLCMLAK 
ARES ASAKI KEEEAAKD 


7020 


1 


2154 


FADS KRKS VLLDK I KNi*QVALTS KQQ S LETAMSFVARNT FKRVR 
KGFLMRICVAVFFSNTPTRASPQLREAVLKLSDAGITPLFLTRQE 
DROLINALQINNTAVGHALVLPAGRDLTDFIjENVLTCHVCLDIC 
Ml DPS CGFGSWRPSFRDRRAAGSDVDI DHAF I LDSAETTTLFQF 
NEMKKYIAYLVRQLDMS PDPKASQH FAR VAWQHAPS ES VDNAS 
MPPVKVEFSLTDYGSKEKLVDFLSRGMTQLOGTRALGSAIEYTI 
ENVFESAPNPRDLKIWLMLTGEVPEOOLEEAQRVILQAKCKGY 
FF WLG IGR K VNI KEVYTFASEPND VF FKL VD KSTELNE E PLMR 
FGRLLPSFVSSENAFYjbSPDIRKOCDWFQGDQPTKNLVKFGHKQ 

vJHV rNlNV J 5or lSWrv i i J, I\rv 1 i 1 rvr V til I r^sr V 1111 rwr V L X 

I NO PS VKPAAAKPAPAKPVAAKP VATKTATVRPP VAVKPATAAK 
PVAAKPAAVR P PAAAAAKPVATKPEVPRPOAAKPAATKPATTKP 
MVKM S RE VQV F EITENS AKLHWER PEP FG P Y F Y DLTVTS AHDQS 
LVLKQNLTVTDRVIGGLLAGQTYHVAWCYLRSQVRATYHGSFS 
TKKSQPPPPOPARSASSSTINLMVSTEFLALTETDICKLPKDEG 
TCRD F I LKW YYDPNTKS CAR.FW YGG CGGK EN KFGSQ KECEK VCA 
PVLAKPGVISVMGT 


7021 


2 


336 


VNA VS F FPNG YAFATGS DDATCRLFDLRADQ E LLLYSHDN 1 1 CG 
ITSVAFSKSGRLLLAGYDDFNCNVWDTLKGDRAGVLAGHDNRVS 
CLG VTDDGMA VATGS WDS FLR I WN 


7022 


2 


856 


VYIGSFWSHPLLIPDKRKLFEAEEQDLFRDIQSLPRNAALRKLN 
DLIKRARLAKVHAYI1 S S LKKEMP S VFGKDN KKKELVNN1ASI Y 
GR I ER EHO I S PGDFPNLKRMQDQLQ AQDFS KFQ PLKS KLLEWD 
DMLAHDIAOLMVLVRQEESQRPIOMVKGGAFEGTLHGPFGHGYG 
EGAGEGIDDAEWWARDKPMYDEI FYTLSPVDGKITGANAKKEM 
VR S KL PNS VIiGK I WKLAD I DKDGMLDDDE FALANHLI KVKLEGH 
ELPNELPAHLLPPSKRKVAE 


7023 


2 


748 


AMV FGG WPY VPQYRD I RRTQNADG FSTY VCLVLLVAKI 1>R I LF 
WFGRR FES PLLWQSA1 M I LTMLLMLKLCTEVRVANELNARRRSF 
TAADSKDEEVKVAPRRSFLDFDPHHFWQWSSFSDYVQCVliAFTG 
VAGY1 TYLS 1 DSALFVETLGFIAVLTEAMLGV PQLYRNHRHQST 
EGMS 1 KMVLMWTSGDAFKTAY FLLKG AP LC FS VCGLLQVLVDLA 
I U3QAYAFARHPQKPAPHAVHPTGTKAL 


7024 


1207 


190 


RTGVTGWAQVWMFGGGGVLSSGEOLQMPVKPERGLGPSDGWLV 
SSRRGSPGTVLGi,pFWLLTPVLVSRSIRSMLLLTRSPTAWHRl*S 
OLKPPVLPGTLGGQALHLRSWLLSRQGPAETGGQGQPQGPGLRT 
RLLlTGLFGAGLGGAWLALRAEKERliOQOKRTEALRQAAVGQGD 
FHLLDHRGRARCKADFRGQWVLMYFGFTHC PD I CPDELEKLVQV 
VROLEAEPGLPPVQPVFITVDPERDDVEAMARYVQDFHPRLLGL 
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BNSOOCICh <WO 0153312A1J_> 



WO 01/53312 



PCTAJS0O/34263 



SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
tc first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Air.: no acid segment containing signal peptide 
(A=Alanine, C^Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanz.ne, G^Glycine, 
H=:nistidine, I=sIsoIeucine , K=Lysine. 
L= Leucine, M=Methicnine, N=Asparagine, 
PfeProline, Q=Glutamine, R=Arginine, 
St Serine, T= Threonine, V= Valine, 

Tryptophan, Y=Tyrosine, X= Unknown, +=Stop 
Cooon, /=possible nucleotide deletion, 
^possible nucleotide insertion) 








TGSTKQVAQASHSYRVYYNAGPKDBDQDYIVDHSIA1YLLNPDG 
LFTDYYGRSRSAEQ3 SDSVRRHMAAFRSVLS 


7025 


232 


832 


ERNSPIGNNENL*K\HSLDCLCFKGBWEGNTQFCTLODNQEECF 
KOVIRTCEKRPTFNQHTVFNLHQRLNTGDKLNEFKELGKAFISG 
SDHTQHQL I HTS E KFCGDKECGNTFLPDSE VI QYQTVHTVKKTY 
ECK2CGKSFSLRSSLTGHKR1HTGEKPFKCKDCGKAFRPHSQLS 
VHKRIHTGEKSYECKECGKAFSCG 


7026 


32B 


1146 


N PR P S 1GD 1 KDIKKAAXSMLDPAHKSHFHP VTPSLVF LCFIFDG 
LHOALLS VG VS KRS NTWGNENEERGT P YAS RFKDM FNF I ALE K 
SSVLRHCCDLblGVAAGSSDKICTSSLQVQRRFKAMMASIGRLS 
HGESADLL3SCNAESAIGWISSRPWVGELMFTFLFGDFESPLHK 
LRKS £ * LPRKHR * QP 3NAVRMFLDQCMDGS I ALRAI VSEI PVFE 
EKK"vNG*KGIGEIF*VWGCTLPPHYWGAVTTNVPKLSNSGKIjLG 
qdeqphifg 


7027 " 


43 


954 


GRRLQC?QORPEDAEDGAEGGGKRGEAGWEGGY?EIVKL*NKI>FEH 
YY0ELKIVPEGEKGQFMDALREPLPATLR2TGYKSHAKEILHCL 
KNKY F KELEDbEMDGQKVEVPQPLSWY PEELAWHTNLS RKILRK 
^Pl-I.FKPHDPl^V^PTP^GMT ^ROFAV^MT PPL.LLNVRPHHKILD 
MCAAP GS KTTQLI EMLHADMNVPFPEG FVI ANDVDNKR CYliLVH 
QAKR LSS PCI MWNHDAS S I PRLQI DVDGR KE 1 LFYDR 1 LCDVP 
CSGDGTMRKNIDVWKKWTTLNSLQbHGliOLRIATRGAEOL 


7028 


189 


608 


SRF P PEPE PGTMVEKGSDS S S EKGGVPGTPSTQSLGS KN KI RNS 
KKKCSWYSMLSPTYKQRNEDFRKLFSKLPEAERLIVDYSCALQR 
ElLbOGRLYLSENWI CFYSNI FRWETTI S10LKEVTCLKKEKTA 
KL1PNAIQ 


7029 


1343 


40 


VLES KTEAKQATGTS SKLRHGTGQE KGREGPRCPSGLAQLRLWG 
/PC PHAGRETGPRAS AP IPGS* GHG WH W * RKDGRGER S EG PSAL 
SPHSPSLLNMQOAPTHVGPGMGSORPRSSWPEQVGVGSOLSRE 

TRKRNSRRNTMVS RGGGCLLY PIjQS I M PE* QLR * GAHAS PPTQG 
R*GKGGPRSPLTKASGTTHI PTPFFGS1 P/RPTRDSGPGTDNS\ 
AAPGOKRGHREA*OGPEPV/WGRVTTHLOGPAG*TKPLGS\RNW 
VPGPAEGEQC-EGAGLEGRP* PLKGCRSTLTFSPQLSI PMVGKKP 
PEGTTASFFP\RSCHSE*RKPPPSCPHAPALSLPHPLPLPLPPL 
PL»F LPG AGT * HS AR S GRPG0 S ETGS LCHNCHH CPPHCPKCS PGG 
T 


7030 


2 


521 


FVCFS APGSGOGGKRRVKMELSAVGERVFAAEALLKRR I RKGRM 
EYL-VKWKGWSQKYSTWEPEEN I LDARLLAAFEERERHMELYGPX. 
KRGPKPKTFLLKA0AKAKAKTYEFRSDSARG2RIPYPGRSPQDL 
ASTSRAREGLRN\RVCPRQRAAP AF AAP \PRRGP SGPGPRPG* G 
PGLKFPGPGGPSKHGFVPASEQHQHQQKLPRRGPSGPGPRPG 


7031 


960 


59 


HCSVPGAEWPRKPPAQICPQLTSRPHLSSPRSLSPGCGKSPGPG 
/CKPS /RHCDELHEGPSRTAALPCGKPOPKHGVEECG/ PCPCLA 
PRRXTE P PALTVS P VGRAAP S GAL * PSGRACS ACSHRLA PE AAL 
SAAAPRPSLGSGQNASGLPAAS LPPQDS SOPHKTVPS PARSVP P 
liGAOARAAPPRLWCPRALVSG * EAS PEAVS VAAGPPVPGPTPST 
SGS TASH SRRGC* S PR * TPA P PRRDHGRS AAFEVLTAAASAQPC 
ASOGGPRPTGAGRTPSPLGLPFSRGPPAASARPFCRHPSL 


7032 


1393 


2104 


RRPGRTEPVEPPPVPPPPRASNSKSRCR* RNLHLAPL* QSPLRK 
SRQ3 GTSSLPFGRSAGERPRPAATFCLSRGGSSPVFL* PSSSSL 
EPWMKRQFGRLHSLPWKSWOKMNSFLLTPKLDTSLMSGWRYRQR 
LPRLHTFLKKSLQMASELAPPLPTPAPLASSLPPPPGPPPLLPV 
PLA*LSRSGILVPPNSGFSLSC\PLGDH*GSSGEVRGSCGSPPP 
HHCWVLPPPP*LLLPPR 


7033 


689 


815 


RSRDCLSSSATSNRARRSKCSGPKRATPLDSGPGP*APPGPSSA 
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SEO 
ID 
NO: 


Predictec 
beginning 
nucl eot ide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predictec end 
nucleotide 
loca t i on 
corresponding 
to first 
amino acid 
residue of 
amino acid 
seguence 


Amino acid segment containing signal peptide 
(A-Alanine, C=Cysteine, D^Aspartic Acid, E^= 
Glutarr.ic Acid, F=?henylalanine, G=Glycine, 
H=Histidine, I=:lsoleucine t K=Lysine, 
Lsbeucine, M=Methionine , N«Asparagine , 
P=Proline, Q=Glutamine, R^Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y^Tyrosine, X-Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








LMM P S S CP W RTGALGP S P AGS RALGRCTSS VGPGSRWL7RTS S P 
GCATR TWRTM R MEPR P LR S RMGE S APG J PAEL PS AA PS G P SAPS 
AAAPSAPTTPAAAGPNTL*SRRTAEWCWPPSCSCOfGWC*SWSA 
WDWRRPPLQVSPAPSSSCRASCCWCLESIT*SSSTARSRATGAS 
SSSTCPTSRSDRGAAWTP\SPMGAPLLPCSVPLISREEALODPR 
NPSP*GVCSGSSGHAGLALGKPPVACSVP 


7034 


92 


1942 


EDTSSMPFRLLIPLGLLCALLPQHHGAPGPDGSAPDFAHYRERV 
KAW7YHAYDSYLENAFPFDEbRPLTCDGHDTV3GSFSlTLlPAl J D 
TLL\ TLFYF0I LGNVSE FQR WE VLQDS VDFD I DVNAS VFETNI 
RWGGLLSAHLLSKKAGVEVEAGVJPCSGPLLRMAEEAARKLLPA 
FQTPTGMPYGTVNLLHGVNPGETPVTCTAGIGTFlVEFATbSSL 
TGDPVFEDVARVALMRLWESRSDIGLVGNHIDVLTGKWVAQDAG 
IGAG VDS Y FE Y1j VKGA I LLQDKKLMAM FLE YNKAI RNYTR FDDW 

TVWKOFGGLPEFYNIFOGYTVEKREGYPLRPELIESAMYLYRAT 
GDPTLLELGRDAVES I E K I S KVECGFATI KDLRDHKLDNRMES P 
FLAETVKYLYLLFDPTNF3 HNNGSTFDAVITPYGEC J LGAGGYI 
FNTEAHPIDPAALHCCORLKSEQWEVEDLMREFYSLKRSRSKF0 
KNTVSSGPWEPPARPGTLFSPENKDQARERKPAKQKVPLLSCPS 
QPFTS KLALLGQVFLDSS * PLDN F F I F I FLRLN YNKLLLA 1 1 KK 
K 


7035 


92 


1942 


EDTS S MP FRLL I PLGLLCA LL PQH HGA PG PDGS APD P AH Y R ER V 
KAMFYHAYDSYLENAFPFDELRPLTCDGHDTWGSFSLTLIDALD 
TLL\TLFYF0ILGNVSEF0RWEVL0DSVDFD1DVNASVFETN3 
RWGGLLSAHLLSKKAGVEVEAGWPCSGPLLRMAEEAARKLLPA 
FQTPTGMPYGTVNLLHGVNPGETPVTCTAGIGTFIVEFATLSSL 
TGDPVFEDVARVALMRLWESRSDIGLVGNHIDVLTGKWVAQDAG 
I GAG VDS YFE YLVKGA I LLQDKKLMAM FLE YNKAI RNYTR FDDW- 
YLWVyMYKGTVSMPVrOSLEAYWF^l/QSblbUl DNAMK i r L>N J i 
TVWK0FGGLPEFYNIPOGYTVEKREGYPLRPELIESAMYLYRAT 
GDPTLLELGRDAVES3 EK1 SKVECGFATI KDLRDHKLDNRMESF 

C"T HL'lwvr VT T PnDTMPTlI7Jltf/^CTPnZlVTTDV/^Pr*r T Y^firY^VT 

FNTEAHP I DPAALHCCORLKE EQWEVEDLMREPYSLKRSRS KFQ 
KNTVSSGPWEPPARPGTLFSPENHDQARERKPAKQKVPLLSCPS 
nPFTSinj\T.T/5£>vpr,n c ; , 3'* pt.dnpfiftflrlnynkijLlaiikk 
K 


7036 


442 


761 


CLAPLFSCFQIINLHLiAPSGRLRWAWLRGPGRN*LPGEGPSIPT 

RNW* ERKAGCSQPC/ paqqhhgrppgvs plprdphpttlrplpp 
ppppppppprrpprnrrpg 


7037 


442 


761 


CLAPLFSCFQI 1NLHLAPSGRLRWAWLRGPGRN * LPGEG P S 1 PT 
RNW*ERXAGCSQPC/PAQQHHGRPPGVSPL?RDPHPTTLRPLPP 
PPPPPPPPPRRPPRNRRPG 


7038 


155 


891 


GAGAASDMSSGLRAADFPRWKRHI SE0LRRRDRLQR0AFEEI IL 
Q YNKLLEKS DLHS VLA0 KLQAE KHDVPNRHE I S PGH DGTWNDNQ 
LQEMAOLRIKHQEELTELHKKRGELAQ\RVIDLNNQMQRKDREM 
QMNEAKIAECLQTISDLETECLDLRTKLCDLERANOTLKDEYDA 
LQITFTA1»EGKLRKTTEENC>ELVTRWMAEKAQEANRLNARE*KR 
LQEAAS PAAERACRS S KGTS TS RTG 


7039 


155 


891 


GAGAASDMSSGLRAADFPRWKRHI SEQLRRRDRLQRQAFEEI IL 
QYNKLLE KS DLHSVLA0 KLQAE KHDVPNRHE I S PGHDG T WNDNQ 
LQEMAQLR I KHQEELTELHKKRGELAQ \R V I DLNNQMQRKDREM 
QMNEAKIAECLQTISDLETECLDLRTKLCDLERANOTLKDEYDA 
I*QITFTALEGKIjRKTTEEN0ELVTRWMAEKAOEANRLrJARE*KR 
LQEAAS PAAERACRS S KG TSTS RTG 


7040 


34 


769 


KITPPRRPHRCSSGHGSDNSSVLSGELPPAMGKTALFYHSGGSS 
GYESVMRDSEATGSASSAODSTSENSSSVGGRCRSLKTPKKRSN 
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! SEO 
i ID 
| NO: 


Predicted 
beginning 
nucleotide 
location 
correspondinc 
to first 
amino acid 
residue of 
amino acic 
sequence 


Predicted enc 
nucleotide 
vocation 
correspondinc 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 1 
(A=Aianine, C=Cysteine, D=Aspartic Acic, E= 
Glutamic Acid, F=Phenylalanine, G^Glycme, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P= Proline, Q=Glutamine, R=Arginine, 
S=Serine, T= Threonine, V=Valine, 
W=Trypto?han, Y^Tyrosine, Xs=Unknown, »=Stop 
Codor., /-possible nucleotide deletion, 
V=possible nucleotide insertion) 








PGSQRRRLI PALSLDTSSPVRKPPNSTGVRWVDGPLRSS FRGLG 
EPFE1 KVY E I DDVERLORRRGGAS KEAMCF NAKLKI LEKRQQR I 
AEVRAKYEWLMKELEATKQYLMLDPNKWLSEFDLEQVWELDSLE 
YLEALECVTERLESRVNFCKAHLMMITCFDIT 


7041 


2 


567 


SGRVAMGRRRAPAGG S LGRALMRHQTQRS RSHRKTDS KLHTS E L 
NDG YD WGRLNL-OS VTSOSSLDDFLATAELAGTE F VAE KLN 1 K FV 
PASARTGLLSFEESOR IKKLHEENKQFLC1 PRRPNWNOI^TTTPEE 
LK0AEKDNFLEWRR0L\VRLEEEQKLILTPFERNLDFWRCLWRV 
IERSD1WQIVDA 


7042 


7 


345 


PIHMAAAALRADI \ 1 S PLFPHlQGYLLLS/iSHG \ATS LKTKGAL 
PLETVTMYTV1PKSKYVLVKPDTQYPYSENLDEFKRLASNSASN 
DDLLMAEVAI S DYGDKLTLELREKY 


7043 


2 


2170 


ARGMAAR DSDS E EDLVS YGTGliEP LEEGER P K K P I P L0D0TVRD 
EKGRYKRFHGAFSGGFSAGYFNTVGSKEGWTPSTFVSSRQNRAD 
KS VLG PEDFMDEEDLSEFG I APKAI VTTDD FAS KTKDR1 KE KAR 
OLAAATAPI PGATLLDDLI TPAKLSVGFELLRKMGWKEGCX5VGP 
RVKRRPRRQKPDPGVKIYGCALPPGSSEGSEGEDDDYLPDNVTF 
APKDVTPVDFTPKDNTVHGLAYKGLDPHQAL FGTSGEH FN LFSGG 
S ERAGDLGE I GLNKGRKLG ISGQAFGVGALEESDDDI YATETLS 
KYDTVLKDEEPGDGLYGWTAPRQYKNQKES EKDLRYVGKI LDGF 
SLASKFLS SKKIYPPP ELPRDYRPVHYFRP MVAATSEN S HLLQV 
1»S ES AG KATPDPGTHS KHQLNASKRAELLGETP I QGSATS VLEF 
LSOKDKER I KEMKQATDLKAAQLKARSLAQK AQS SRAQPS PAAA 
AG H CS WKMA LGGGTAT LKASKF KP FAKDPE KQ KR YDE FL VKM KQ 
GQ K DALER CLD PS MT E WE RGRERDE FARAALL Y ASS H S TL S SR F 
THAXEEDDSDQVEVPRDQENDVGDKQSAVKKKMFGKLTRDTFEW 
HPDKLLFQ/RLVGLPRVKRDKYSVFNFTiTLPETASLPTTOASSE 
KVSQHRGPDKSRKPSRWDTSKHEKKEDSISEFLRLARSKAEPPK 
QQSSPLVNKEEEHAPELSAN 


7044 


276 


734 


EVY LTDEFAKGRKVADLYELVQYAGNI I PRLY LL ITVGWY VKS 
FPQSRKDILKDLVEMCRGVQHPLRGLFLRNYLLQCTRN I LPDEG 
EPTDEETTGD1 S DSMDFVLLNFAEMNKLKVRMQHCX3HSR EKR 
ERERQELR I LVG TNLVRLSQV 


7045 


3 


513 


LGFKMEALS RAGQEMS LAALKQHDP Y ITS1 ADLTGQVALY TFCP 
KANQWEKTD1 EGTLFVYRRSASPYHGFTI WRLNMHNLV E P VN K 
DLE FQLHEP F LLYRNAS LS I YS I W FYDKNDCH R I AKLMAD W EE 
ETR RS QQA/R SGQTE SQPGQWLQR PQAHRHPG D AEQSQG 


7046 


3 


513 


LGFKMEALSRAGQEMSLAALKQHDPYITSIADLTGQVALYTFCP 
KANOWEKTDIEGTLFVYRRSASPYHGFTIVNRLNS^HNLVEPVNK 
DLE FQLHEP FLLYRNAS LS I YS I W F YDKNDCK R I AKLMAB WEE 
ETRRSQQA/RSGQTESQPGQWLQRPQAHRHPGDAEQSQG 


7047 


103 


486 


0MK1 EX CGWS EGLTS I KGNCHNFYTAISKDVTY KELKJNLLN S KN 
IMLI DVREI WEI LEY0K1 PESINVPLDEVGEALQMNPRDF KEKY 
NEVKPS KSDS / 1 V FS Y LAG VRS KKALDTAI SLGFHS YYER 


7048 


92 


627 


FFCLTLLSSWDYRHHATRRVISSPVFTMEDSGKTFSSEEEEANY 
WKDLAI-1TYK0RAENT0EELREFQEGSREYEAELETQLQ01ETRN 
RDLLSENNRLRMELET 1 KEKFEVQHSEGYRQ1 SALEDDLA0TKA 
IKDQL0KYIRELEQANDDLERAKRATDHGLSKTFE\0RLK\0AI 
EKKW 


7049 


393 


938 


KRTGSASYGGPPPGLGGPATXASVAGRCSSVGKIPARRCYEDEL 
VPVFEAVGR I YELRLMMDFDGKNRGYAFVMYCKKHEAKRAVREL 
NN YE 1 R PGRLLG VC CS VDNCR1/FI GG I PKMKKR EE I LEE I A KVT 
EGVLDV I VYASAADKMKNRGLRLRG VREPPRGCHWLGRKL1 AVJX 
ASSLWG 


7050 


393 


938 


KRTGSASYGGPPPGLGGPATXASVAGRCSSVGK2PARRCYEDEL 
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S2Q [ 
ID 
NO: 


Predicted 
be? inning 
nucleotide 
lecation 
corresponding 
tc first 
an;:' no acid 
reridue of 
air.: no acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing sicnal peptide 
(A=Aianine, OCysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G-Glycir.e, 
H=Histidine, I =Isoleucine, K=Lysine J 
L=Leucine, M=Methionine, NeAsparacine , 
P=Proline, Q=G) utamine , R=Arginine, 
S=Serine, T= Threonine, V=Valine, 
WsTryptophan, Y«* Tyrosine, X= Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion} 








VPVFEAVGR I YELR LMMDFDGKNRG YAFVMYCHKHEAKRAVREL 
NNYEIRPGRLLGVCCSVDNCRLFIGG1PKMKKREE1LEEIAKVT 
EGVLDVIVYASAADKMKNRGLRLRGVREPPRGOWLGRKL1AV7X 
ASSLWG 


7051 


119 


816 


KKMNbAEICDNAKKGREYALLGNYDSSMVYYQGVMOOIORHCQS 

vrdpaikgkwqqvr0ellbeyeqvks1vgtlesfk1dkppdfpv 
sc0depfrdpavwpfpvpaehrapp0irr/rosrsktseerngr 
srspgtcrpstNpisksekpstsrdkdyrargrddkgrknmodg 
asdgemfkfdgagydkdiivealerdivsrnpsihwddiadleea 
kkllreagvlpmwk 


7052 


467 


715 


SCPGRGKMSKLLNPEEMTSRDYYFDSYAHFGIHEEMLKDEVRTL 
TYRNSMYHNKHVFKDKVVLDVGSGTGILSMFAARQGPRR 


7053 


467 


715 


SCPGRGKMSKLLNPEEMTSRDYYFDSYAKFGIHEEMLKDEVRTL 
TYRNS MY HNKHVFKDKWbDVGSGTG 1 LSMFAARQGPRR 


7054 




1036 


CVP<OR^RFTn&RRRCikfi&pPTZU5T,PWDZXAT»PFWP<^rPCPPT/3Pf2 

RRCRWDAWEYDEKLARFRQAHLNPFNKOSGPROHEOGPGEEVPD 
VTPEEALPELPPGEPEFRCPERVMDUSbSEDHFSRPVGLFLASD 
VQQLROAI EECKQV I LELPEQSEK0KDAWRLIHLRLKLOELKD 
PNEDEPN I R VLLEHR F Y KEKSKS VKQTCDKCNT I IWGLl QTWYT 

DYRCAECRAP I / CS/ DGWPSEAROCDY TGQYYCSHCHVJNDLAV 
I FAR WHNWD FEPRK VS RCSMRYLALh5VSR PVLRLRE1 N 


7055 


2 


527 


DSRRVSWRSWIJ^NE/'aGKHLCLFIWLSMNVLLFWKTFLLYNQGP 
EYHYLKO^a^/ALCLSRASASVL^^L^3CSLXLLPMCRTLlAYX,RG 
^OKA/P^RRTRRT T.ntfCETPtf TTrP-JxTT PT F^fJVHVBJXHT.VMAT.Nf 

FSVNYSSDFVELNAARYRDEDPRKLLFTTVPGLTGVCMEWIiFL 
K 


7056 


2 


527 


DSRR VS WRS WLANE/WG KHLCLF1 WLSMNVLLFWKTFLLYNQG P 
EYHYLH0MU5/AbCLSRASASVLNLNCSLILLPMCRTLLAYLRG 
SOKVPSRRTRRiLDKSKTFHITCGATIClFSGVHVAAHLVNALN 
FS VNY S E D FVELNAARY RDEDPR K LL FTT VPGLTG VCME WliFL 
M 


7057 


136£ 


431 

r 


G I YT.-HVN E K 1 P R PTC 1 GDR f)E NDK F N 1 jN NH R DO E LLiHA S COA 
SGEVPSQASLRGFFTEDEPGCFGEGENLPEALQNIODEGTGEQL 
S PQER I S EKQLGQHLPN PHSG EMSTM WLE E KRETS 0 KGQ PRAPM 
AQKLPTCRECGKTFYRN SQLI FHQRTHTGET Y FOCT I CKKAFLR 
5SDFVKHQRTHTGEKPCKCDYCGKGFSDFSGLRHKEKIHTGEKP 
YKCP3CEKSFI0RSNFMRKQRVHTGEKPYKCSHCGKSFSWSSSL 
DKHC>R SH LGKKPFQ * PVTKLSFP2 SIS DPSHKNTQLHQEELCLR 
GYPC 


7058 


3 


469 


FS GFGAV PDAiiGCRMSDLRITEAFLYNDYLCFRAl.CC KG PP PAR 
PEYDLVC1 GLTGSGKTSLLSKLCSESPDNWSTTGFS IXAVPFQ 
NAILNVKELGGADN1 RKYWSRYYOGSQGVI FVLDSASSEDDLEA 
ARN*SCTOLLQHPQLCTLPFLILA 


7059 


1 


1178 


WPA FPRQ PAA.WMDALLGTG PRRARG CLGAAG PTS SGRAARTPA 
APWARPSAWLECVCWTFDLELGQALELVYPNDFRLTDKEKSS1 
CYLSFPDSHSGCLGDT0FSFRMROCGG0RSPWHADDRHYWSRAP 
VALOREPAH YFGY VY FR QVKDSSVXRG YFQKSLVLVS RLPFVRL 
FQALLSLIAPEYFDKLAPCLEAVCSE1DQV7PAPAPGQTLNLPVM 
G WV0VR I PS R VDKSES S PPKQFDQENLLPAPWLAS VHELDLF 
RCFRPVLTHMQTLWELMLl^EPLLVIiAPSPDVSSEMVLALTSCL 
OPLRFCCDFRPYFT IHDSEFKEFTTRTQAP PNWLGVTNP FF1K 
TLQHWPHILRVGEPKMSGDLPKQVKLKKPFKV*RPWDTKP 


7060 


90 


1670 


SVNLPPSLWPWEEAMDSTKSEPLKGSFEAEDGNIEYKKLVNPSQ 
YRFEHLVTOM KWRLQEGRGEAVYQI GVEDNGLLVGLAEEEMRAS 



592 
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# 



WO 01/53312 



PCT/US00/342C>3 



SEQ 
ID 
NO: 


Predicted 
beginning 

location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 

corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


/ynino acid segment containino signal peptide 
(A^Alanine, OCysteine, D=Aspartic Acid, E= 
uAuidiniL rtciu/ r — rTiciiy jLaxani ne , v-vjiytint , 
H=Histidine, 3 =1 soleucine, K=bysine # 
L=Leucine, K=Methionine, N=Asparagine, 
P=Proline, C=GIutaroine, R=Arcinine, 
S^Serine, T= Threonine, V*Val:ne, 
W=Tryptophan, Y=Tyrosine, X=Unknovm, *=Stcp 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








LKTLHRMAE KVGAD I T VLRERE VD YDSDK PRK1 TEVLVRKVPDN 
00FLDLRVAVLGKVDSGKSTLLGVLTQGELDMGRGRARLNL7RH 

RT*DGGRVWRLFEIV*MMALRGL*TSSAPLRKSMGNQLN*IKN6 
VKIKROGKPGNGLGPGNSEGVGRAGRRH^GPWALXSQWNYSDSR 
7AEE 1CESSSKMITFI DLAGHHKY LHTT 3 FGLTS YCPDCALLLV 
S ANTG I AGTTREHIjGLALALKVPFFI W£ KI DLCAKTTVERTVR 
QLERVLKQPGCHKVFMLVTSEDDAVTAAOOFAOSPNVTPIFTLS 

PEVGTWGGTLSR* IDLliATLPTQPSPI YSKTSWPKGGDPG 3 


7061 


364 


710 


ARMPSPLGPPCLPVMDPETTLEEPETARL-RFRGFCyQEVAGPRE 
ALARLRELCCQWLOFEAHSKEQMLEMLVLEOFLGTLPPEIQAWV 
RGQR PGS PEEAAAL VEGLQHDP * ARMPS F LG P PCLPVMDPETTL 
EEPETARLRFRGFCYQEVAGPREAIiARLRELCCOWLOPEAKSKE 
QMLSML VLE QFLGTL P PE I 0AWVRG0RPG S P E EAAAL VEGLC/HD 
PGOLLG 


7062 


71 


744 


AKAGTNLERLHWLSyFFCIPKHKLKSSQKDKVROFMACTQAGER 
T AI Y CL.TQN EWRLDE ATDSFFQH PDSLHR ES MRN AVDKKKLE RL 
YGRYKDP0DENKIGVDGIQQFCDDLSLDPAS1SVLV3AWXFRAA 
TOCEFSRKEFLDGMTELGCDSMEKLKALIPRIjEOELKDTAKFKD 
F YQPTFTFAXNPGQKGLDL* MAGAYWKLVLSG R FKFL Y LWNT FL 
MEHH 


7063 


2 


562 


LRTVPDLPGRRFRAMRTGQRR*PELPPDKNSLE0AED1»KAFERR 
LTEYIHCLQPATGRWRMLLIWSVCTATGAWNWLIDPETQKVEF 
FTS LWNH P FFT I SC 1 TLIGLFFAG I H KRWA P S 1 1 AAR CRT VIA 
EYNMS CDDTG KLI LKPR PHVQ* QS S LIVKGLK I AFLR I SDTAKS 
HKGFLLRLDM 


7064 


300 


684 


RDTGSDPSSTRRLCSTCCTGH*PAEPIA£PHPSRGTCPPASSAS 
SRRTGCKTCPPESGHAQARRSRRASASRWGARGAVRSAVAARGC 
SbRAGKWLfcTPGKKKGP PAUVAAAGKlikC.-.t' Ar * AAPP 1 AbVPAR 
CRCPAARTGAPAAATV3LRRRLSGLRAPALGRRRSPGPSPKSAAP 
PLLTPLGAGRAGGSRANS 


7065 


1 


555 


AJ I lHSAKKbGKGAAAEAAAoAAGGKyKOhJJKi^WbCsAKJ I PvjCj 
RSOSEPKAPPPQKRSEAAFASMAKSPVAVCVPGMQNNIADPEEL 
FTKLERIGKGSFGEVFKGIDNRTQOWAi KI IDLEEAEDEIEDI 
G<)EITVLSOCDSSyVTKYyGSYLKGSKI J Vs 1 33^!lEYl J GGGSALDLL 
RAGPFDEFQ 


7066 


356 


676 


FGPQRGPWRAREGGH PLDPADH FRAPAS LR SNVRAATKMQ I CDT 
YNOKHSLFNAMNRFIGAVNNMDQTVMVPSLLRDVPLADPGLDND 


7067 


152 


973 


KEN 1 TMATE 3 GS P PR FFHM PRFQKQAPROL F Y KR PDFAOQQAMQ 
QLTFDGKRMR KAVNRKT 3 DYNPS VI KYL ENR I WQ R DQR DMRA I Q 
PDAGYYNDI.iV P P I GMLNNPMNAVTTKFWTSTNKVKCPVFVVRW 
TPEGRRLVTGASSGEFTLWUGLTFNFET3 L0AHDS PVRAMTWSH 
^3^MVJM^3TAnPGG VVK Y WOSNMATNVKMFOA *-"KEA " REARFI HN T P 
FS WP I VMVKLFS XC I LGAEMHGLCQFLGNFIiHP INTI FFFVFT 
HSPFCWAPF 


7068 


222 


816 


DTMKEYVLLLFLAIjCSAKPFFSPSHIALKNMMLKDMED'J'DDDDD 
DDDDDDDDDDEDNSLFPTREPRSHFFPFELFPMCPFGCQCYSRV 
VHCSDLGLTSVPTNIPFDTRMLDLQNNKIKE1KEWDFKGLTSLY 
GLILNNNKLTKIHPKAFLTTKKLRRLYLSHNQLSEIPLNLPKSL 
AELR3HENKVKKIQKDTFKKK 


7069 


1147 


1765 


FKDHRRYFyVNEOSGESQWBFPDGEEEEEESQAOENRDETLAKQ 
TLKDKTGTDSNS7ESSETSTGSLCKESFSG0VSSSSLMPLTPFW 
TLLOS2^VPVLOPPLPLEMPPPPPPPPES?PPPPPPPPAPKMPPP 



593 



BNSDOCJD: <WO 01S3312A1 J_> 



WO 01/53312 



PCT/US00/34263 



SEQ 
ID 

NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to firs: 
amino acid 
residue of 
amino acid 
sequence 


Predicted end j 
nuc3 eotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aepartic Acid, E= 
Glutamic Acid, F= Phenyl alanine, G=Glycine, 
H=Histidine, I«Isoleucine, K»Lysine, 
Ij=Leucine, M=Methionine, N^Asparagine , 
P=Proline, QsGlutamine, R=Arginine, 
5-Serine, T=Threonine, V= Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








EKTKKGRKDKAKKSKTKMPSLVKKWQS1QRELDSEDNSSSSEED 
RVSTAQKRIEEWKQQQLVSGKAERNANFEA 


7070 




547 


DGTWEDSEAVORATALIEORIAOEEENEKLRGDARQKLPMDLLV 
LEDEKHHGAQSAAbQKVKGOERVRKTSLDLRREIIDVGGIQNLI 
ELRlOtRXgKKRDALAASHEPPPEPEEITGPVDEETFLKAAVEGK 
MKV1EKFLADGGSADTCDQFRRTALHRASLEGHME1LEKLLDNG 
ATVDFQ 


7071 




021 


ARGTLRALETAKKVGKVGANGOKAAGPSADSVTENKIGSPPKTP 
VSNVAATSAGPSNVGTELNSVPOKSSPFLTRVPAyPPHSENlQY 
FQDPRTQIPFEVPQYPQTGYYPPPPTVPAGVAPCVPRFVRSNNV 
PESShPPASMPYADHYSTFSPRDRMNSSPYQPPPPQPYGPVPPV 
PSGMYAPVYDSRRIWRPPMYCRDDIIRSNSLPPMDVMHSSVYQT 
SLRERYNSLDGYYSVACQPPSEPRTTVPLPREPCGHLKTSCEEQ 
1RRKPDQWAQYHTQKAPLVSSTLPVATQSPTPPSTLNRGEGS 


7072 




921 


ARGTLPJVLET AKKVG KVG ANG0KAAG PS ADS VTENKI GS P PKTP 
VSNVAATSAGPSNVGTELNSVPQKSSPFLTRVPAYPPHSENIQY 
FQDPRTQIPFEVPQYPQTGYYFPPPTVPAGVAPCVPRFVRSNNV 
PESSLPPASNiPYADHYSTFSPRDRMNSSPVCPPPPQPYGPVpPV 
PSGMYAPVYDSRRIWRPPMYQRDDJIRSNSLPPMDVMHSSVYQT 
SLRERYNSLDGYYSVACOPPSEPRTTVPLPREPCGHL.KTSCEEQ 
IRRKPDQWAQYHTQKAPLVSSTLPVATQSPTPPSTLNRGEGS 


7073 


50 


504 


LAHGSFGVSDFPAPAAAPAHTLTSFSGSLSPOFRKPLGRAPAMP 
LVRYRKW3LGYRCVGKTSLAKOFVEGEFSEGYDPTVEKTYSKI 
VTLGKDEFHLHLVDTAGODEYS I LPYS FI I GVHGYVLVYSVTSL 
HSFQVIESLYQKLHEGHGK 


7074 


263 


1003 


VCPVLCSTRQEPGHSSLVTYFGKPTRRKEFLLGHCIAAGKNNIS 
VDLETNYAELVLDVGRVTLGENSRKKMKDCKLRKKQNERVSRAM 
CALLNSGGGVI KAE I ENEDYS YTKDGIGL.DLENS FSNI LLFVPE 
YLDFMQNGNYFLI FVKS WSLNTSGLR I TTLS SNTjYKRDI TSAKV 
M^ATAALEFLKDMKKTRGRLYIiR PELLAKRPRVDI QEENNMKAL 
AGVFFDRTEIJDRKEKLrFTESTHVEl 


7075 


596 


1005 


NY1NFFFRKEYPPHVQKVEINPVRLSRL0GVERIMKKTEESESQ 
VEPEJKRKVQQKRHCSTyQPTPPLSPASKKCLTKLBDLORNCRQ 
AITLNESTGPLLRTS I HQNSGGQKSQNTGLTTKKFYGNNVEKVP 
1DII 


7076 


279 


1049 


LOSESSNAAEGNEQRHEDEQRSKRGGWSKGRKRKKPLRDSNAPK 
S PLTGYVR FMNERREQLRAKRPEV PFPE I TRMLGNEWS KLP PEE 
KQRYLDEADRDKER Y MKELEQYQ XTEAY KVF S RKTQDRQKGKSK 
RQDAARQATHDHEKETEVKERSVFDIPIFTEEFLNHSKAREAEL 
R0LRKSNMEFEERNAAL0KHVESMRTAVEKLEVDV1QERSRNTV 
LQOHLETLRQVLTSSFASMPLPEXGETPTVDTIDSYM 


7077 


3 


1119 


SSMGSNSEINGIiALRKTDKYGFLGGSOYSGSLKSSlPVDVARQR 
ELKWLDMFSNWDKWLSRRFOKVKLRCRKGIPSSLRAKAWOYLSN 
SKELLEONPRKFEELERAPGDPKWLDVIEKDLHROFPFHEMFAA 
RGGHGQODLYRILKAYTIYRPDEGYCQAOAPVAAVLLMHMPAEQ 
AFWCLV01CDKYLPGYYSAGLEAIOLDGEIFFALLRRASPLAHR 
H LRRQR I DP VLYMTE W FMC I FARTLPWAS VLR VWDMFFCEG VK2 
IFRVALVLLRHTLGSVEKLRSCOGMYETMEOIiRNLPQOCMOEDF 
LVHEVTNLPVTEALIERENAAOLKKWRETRGELOYRPSRRLHGS 
RAIHEERRRQQPPLGPSSS 


7078 


483 


767 


FOGQRMAGEQKPSSNIiLEQFlLLAKGTSGSALTALISQVLEAPG 
\ryWGELLElANVOELAEGANAAYLQI>LNl>FAYGTYPDYIANKE 
SL»PELY 


7079 


_ — 


376 


SWEFKRPKEPSGSDGESDGP1DVGQEGQLSQMARPLSTPSSSQ 
N0ARKKRRGIIEKRRRDRINSSL5ELRRLVPTAFEKOGSSKLEK 



594 



BNSOOCID: <WO_01S3312A1J_> 



WO 01/5331? 



PCT/USOO/34263 



SEQ 
ID 


Precictec 
beginning 

IJU L J CULlUt 

1 c*,cr> t i nr 

± W V- O A * 

corresponding 
tc first 
amino acic 
residue ol 
amino acic 
secruence 


Predicted end 
nucleotide 

lULd ^ J. KJl 1 

nri T^fP ennn r? i no 
s« v x t — o v> 1 1 u ± * * M 

to first 
amino acic 
residue oJ 
amino acic 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cyeteine, D=Asp©rtic Acid, E- 
\j i u Lciiiit- nciU/ rsriicnyioianinB, vj-yxycine , 

U— U-j cm' Hi TIP TssT «nl Piifi'nfi V — 7 .vrc t r>P 

L=Leucine, M=Methionine , N=Asparagine, 
P=Proline, G=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V= Valine, 
W= Tryptophan, Y=Tyrosine, X=Ur.known, *=Stop 
Codon, /=possible nucleotide deletion, 
\spossible nucleotide insertion) 








AEVL0MTVDHLKMLHATGGTGTHALLF0ASFIQ02 F 


7080 


200 


595 


VQbFbEAPCbSbbSCRDHSGGNREbSRKHRDCRVYGSPQDGlPY 
LTHPLCH QD WSVGRLQI RALAT PGHTOGH LVYXLDG E PYKGPS 
CLFSGDLbFLSGCGEFPRKREEbGEEGETEVRAATVPWRALKP " 


7081 


21 3 


506 


AVTEEEM I LNS LSLC YHKKL3 LAPMVRVG?b?MRLLALDVGADI 
VYCEELIDLKMIQCKRWNEVLSTVDFVAFDDRWFRTCEREON 
RWFQMGTS 


7082 


3 


1137 


APSRNTMLMAWCRGPVLLCLRGGLGTNSFLHGLGQEPFEGARSL 

SSLENLRLPTLREESSPRELEDSSGDQGRCGPTHOGSEDPSMLS 
0AQSATEVEERHVSPSCSTSRERPF0AGE1ILAETGEGETKFKK 
LFRLNNFGLLNSNWGAVPFGK2VGKFPG0JLRSSFGK0YMLRRP 
AliED Y W JUMKRGTA1 TFPKD J NM J JjoMTCD i N PGJJTv JjEAGSGMS 

GMSLFLSKAVGSQGRVISFEVRKDHHDLAKKNYKHWRDSWKLSH 
VEEWPDNVDFI HKDISGATED3 K S LT FDAV ALDMLK PH VTLPVF 
YPHLKHGGVCPVYWNIT0V3 ELLD 


7083 


lit 


54 3 


RSNAVQLTRMEYAMKSLSLLYPKSLSRHVSVRTSWTOQLLSEP 
S PKAPRARPCRVSTADRSVRKG I MAY SLEDLLLKVKDTLMLADK 
P FFLVLEEDGTT VETEE Y FQ AL AGDT VFM VLQ KGQ KWQ P PS EQG 
TRHPbSbSHK 


70B4 


3 


522 


NSVSVSSOSRFLASVPGTGVCRSAAADMAASTAAGKQR I PKVAK 
VKNKAPAEVQ I TAEQLLREAK KR EbELLF PPPQOK 1 TDE EELND 
Y KbRKRKTFEDNI RKNRTV I SN W I KYAQVJE ESLKE I ORARS I YE 
RALDVDYRNITbWLKYAEMEMKNROVNHARNI WDRA3 TTL 


7085 


243 


1499 


RQLARbRRRG WRS PFGGAPMAH I T I NQYLQQVYEAI DS RDGASC 
AEbVS FKHPHVANPRbQMAS PE EKCQQVLE PP YDEMFAAHbRCT 
YAVGNHDF I EA YKCQTV1 VQS F LRAFOAH KE ENWAb P VM YAVAL 
DbRV FANNADQQLVKKG KS KVGDMbEKAAELbMSCFRVCASDTR 
AGIEDSKKWGMLFbVNQLFKIYFK3NKLHl>CKPbIRAIDSSNbK 
DDYSTAQKVTYKY i VGRK/YMFDSJJr KUAbEYLtSr AF bHLHRb£>y 
KKKRMILIYbLPVKMbbGHMPTVEIjbKKyHLMQFAEVTRAVSEG 
MLbLLHEALAKHEAFFIRCGIFblbEKLKUTYRNbFKKVYLbb 
KTKQLSLDAFLVAbKFMQVEDVDI DEVQC3 bANbl YWGHVKGYI 
SHQHQKbWSKQNPFPPLSTGC 


1 WOO 






SGHbS MEEFKKI YGNFFP.YGDAS KFAEHV FRT FDANGDGT I DFR 
EF 


7087 


166 


723 


LSGS SAGKVAAPCVPPSNHELVP1 TTENAF KNWDKGEGASRGG 
NTRKSbEDNGSTRVTPSVQPKbQPlRNMSVSRTMEDSCEbDLVY 

ERRPD I TKbKAKVbEFGWPDbKTPALE KICSI CKAMDTWLNAHP 
HRCRVbHNKG 


7088 


2 04 


759 


GTSAASPSSbbEi4AGEITETGELYSSYVGbVYMFNbJVGTGAbT 
MP KAFATAGWbVS bVLbVFbGFMS FMTTTFV I E AMAAAN AQbHW 
KRMENbKEEEDDDSSTASDSDVblRDNYSRAEKRPIbSVQRRGS 
PNP FE I TDRVEMG QMAS MFFN K VG VNb FY FCI I VYbY GDLAI YA 
AAVPFS LMQVTCSATGNDS CG VE ADTK YNDTDRCWG PbRRVD 


7089 


33 


1775 


S VC WE DRY LKARME ES PbS RAP S RGG VN FbNVART Y I PNTKVEC 
RYTLPPG'l-MPSASDWIGIFKVEAACVRDYHTFVWSSVPESTTDG 
SP 1HTS VQFQAS Y bPKPGAQbY QFR YVNROGQVCG OS P PFQFRE 
PRPKDELVTbEEADGGSDIbLWPKATVbOKQLDESQQERNDbM 
QbKbObEGQVTELRSRVQEbERAbATARQEHTEbMEOYKGISRS 
HGEITEERDIbSROOGDHVARILEbEDDIOTlSEKVbTKEVEbD 
RbRDTV KALTREQE KLbGQbKE VQADKEQS E AELQ VAQCENHHb 
NLDbKEAKSWOEEOSAQAQRLKDKVAOMKDTLGQAOQRVAEbEP 



595 



BNSOOCID: <WO 0153312A1_I_> 



WO 01/533)2 



PCT/USOO/34263 



SEO 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucj eotide 
location 
corresponding 
to first 
amino acic 
residue oi 
amino acic 
sequence 


Amino acid segment containing signal peptide 
(A=Alaninc, OCysteine, D=Aspartic Acid, E= 
Glutamic Acid, F= Phenyl alanine, G = Gi y cine , 
K=Histidine, I=2soleucine, K=Lysine, 
L= Leucine, M=Kethionine, N=Asparagine, 
P= Proline, Q=Glutamine , R=Arginine, 
S^Serine, T*Threonine, v«Valine, 
K=Tryptophan, Y=Tyrosine, X^Unknown, *=Stop 
Ccdon., /^possible nucleotide deletion, 
Vpossible nucleotide insertion) 








LKEQLRGA0ELAASSQ0KATLLGEELASAAAARDRT1AELHRSR 
LEVAEVNGKLAELGLHLKEEKCQWSKERAGLLQSVEAEKDKIbK 

lsaeilrlekavoeertqnqvfktelarekdsslvqlseskreE 
telrsalrvlqkekeolqeekoelleymrklearlekvadekmn 
edattedeeaavglscpaaltdsedsspedmrlhpkiafvsvetq 
aslllgle 


70S0 


33 


1775 


SVCWEDRYLKARMEESPLSRAPSRGGVNFLNVARTYIPNTKVEC 
KYTLPPGTMPSASDWIGIFKVEAACVRJDYHTFVWSSVPESTTDG 
SPIHTSVQFOASYLPKPGAOLYQFRYVNRQGQVCGQSPPFQFRE 
FRPMDELVTLEEADGGSDILLWPKATVLQNQLDESOQERNDLM 
0LKLQbEGQVTELRSRVQSLERALATARQEHTEU4E0YKGISRS 
HGEITEERD1LSRQ0GDHVAR1LELEDDIQTISEKVLTKEVELD 

NLDLKEAKSWOEEOSA0A0RLKDKVAOMKDTLG0AO0RVAELEP 
L) KEQIaRG AQELAAS SQQKATLLGE EIaAS AAAARDRT 1 AELHRS R 
LEVAEVNGKLAELGLHLKEEKCOWSKERAGLLQSVEAEKDKILK 
LSAEILRLEKAVQEERTQNQVFKTEUu7EKJ)SSLVQLSESKREL 
TEbRSALRVLOKSKEQLOEEKQELLEYMRKLEARLEKVADEKWK 
EDATTEDEEAAVGLSCPAALTDSEDESPEDMRLHPKAFVSVETO 


7091 


186 


1076 


EGMLTREHRCGRSEEOELEPWPSPKKARSGRWLRNGFKRKMEEP 
EEPADSGQSLVP VY I YSP EYVSMCDS LAKI PKRA5MVHS bl EAY 

SI EYGLGY DCPATEGI FDY AAA1 GG ATI TAAQCLI DGMCKVAIN 

KGDGVEDAFS FTS KVMTVSLHKFS PGFFPGTGDVS DVGLGKGR Y 
YSVNVPIQDGIQDEKYYOICERYEPPAPNPGL 


7092 


522 


809 


KOGINEjDQEESQKFRLGEGCEPISKRQMKKLIKOKQVJEEQRELR 
KOKRKFKRKRKKLFROrOMFPN^DGHnRKRVRRDVVHSTLRTirT 
DCSFBXLM 


7093 


454 


655 


NFGVSGVELAOOASMVRMSFVIAACQLVLGLLMTSLTESSIQNS 
ECPQLCVCEIRPWFTPOSTYREA 


7094 


2 


50& 


PVR ^MHWOVCiFA <i ^RPPWTM ^WNOQ T ^iFFRWWAG^F FP F ^ FYG 

DI 1 AFPLQDYGG I MAGU5SDPWWKKTLYLTGGALLAAAAYLLHE 
LLVIRKCX)EIDS KDAI ILKQFARPNNGVPSLSPFCLKMETYLRM 
ADLPYONYFGGKLSAOGKMPVJI EYNHEKVSGTEFI 1 


7095 


1 


411 


IASSLPKMASLLCSDRVJjYLVQGEKKVRAPLSQLYFCRYCSELR 
SLECVSHEVDSHYCPSCLENMPSAEAKLKKNRCANCFDCPGCMH 
TLSTRATSISTQLPDDPAKTTMKKAYY1ACGFCRWTSKDVGMAD 
KSVGE 


7096 


224 


2067 


ETRSLAVQEKPS OAGRRR SSRI SFAGALFLTRFLLQELLLNNFC 
SAMSPAPDAAPAPAS1 SLFDLSADAPVFQGIjSLVSHAPGEAliAR 
APRTSCSGSGERESPERKLLOGPMDISEKLFCSTCDQTFONHQE 
QR E H Y K&D WHRFNL KQRL KDKPLLS ALD FEKQS S TGDLS S I SG S 
EDSDSASEEDLOTLDRERATFEKJL.SRPPGFYPHRVLFCNAQGQF 
LYAYRC^GPHQDPPEEAELLLQNLQSKGPRDCVVLKAAAGHFA 
GAI FQGRE WTHKTFHR Y TVRAKRGTAQGLRDARGGPSHSAGAN 
LRRYNEATLYKDVRDLLAGPSWAKALEEAGT3LLRAPRSGRSLF 
FGG KGAPLQRGD PRLWD2 PIATRRPTFQELQRVLHKLTTLHVYE 
EDPREAVRLHSPOTHWKTVREERKKPTEEEIRKICRDEKEALGO 
NEESPKQGSGSEGEDGF0VE1.ELVELTVGTLPLCESEVLPKRRR 
RKRNKKEKSRDOEAGAHRTLUQQl'OEEEPSTQSSOAVAAPliGPL 
LDEAKAPGQPELW2CALLAACRAGDVGVLKLQLAPSPADPRVLSL 
LSAPIiGSGGFTLLHAAAAAGRGSVVRLLLEAGADPTVQCQDH 


7097 


255 


1228 


IRTKSAATWEAWPQCGREGSRIITEPCEANAGSRQELOTERISS 
FLAAQGDOAFHSGLETNKSNS ELPLR VGLKVAQG S PLMGGQVS A 



596 



BNSDOCID: <WO__0153312A1J_> 



WO 01/53312 



PCT/USOO/34263 



SEQ 
ID 

NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide . 
(A=Alanine, OCyotcine, D»Aspartic Acid, E= 1 
Glutamic Acid, F=Phenylalanine, G=Glycine, ( 
H=Histidine, I=Isoleucine, K=Lysine, 
L=l»eucine, ^Methionine, N=Asparagine, 
P=Proline, C=Glutamine, R=Arginine, \ 
S^Serine, 7= Threonine , VsValine, 
W=Tryptophan, Y=Tyrosine, X^Unknown, *=Stop : 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








SN S FSRIjHCRN AN3SDWMS ALCPRLWDVPliHHbS I PGS HDTMTYC 
LNKKSP ISHEES RLLQLLNKALPCI TRPVVLKWSVTQALDVTEQ 
LDAG VRYLDLR I AHMLEG S E KNLH FVHMVYTTALVE DTLTE 1 S E 
WLERHPREWILACRNFEGLSEDLHEYLVACIKNIFGDMUCPRG 
E V PTLRQLVI S RGQQV IVSVEDESS LRRHK ELW PGVP YV? W GNRVK 
TEALIRYLETMKSCGR 


7098 


62 


956 


SSFLKRCRKVLGCWG1PSECSLFSTI.EEPRDKEIDNYCVMRLQT 
EAR S GFWAPNR F P VN I CRM7AVDGDRGGSS RE TCRCH F H PS LEA 
LVLLLQDWQPGGVG1 CTS FLG1 S W ALLDYHRALRTCLPS KPLLG 
LGSSVIYFLWNU.LLWPRVLAVALFSALFPSYVALHFLGLWLVL 
LLW VWLQGTD FK PDPS S E WL Y RVTVATI LY FS W FNVAEG RTRGR 
ai ihfafllsds i llvatwvthsswlpsg1 plqlwlfvgcgcff 
IjGLALRLVYYHWLHPSCCWKPDPDQVD 


709S 


992 


210 


LFRLAPGFLRSLARQGYHQIWAFPFLPSGATATWPAASRSRSLA 
ARSLPRSPARPGPr3DALLGEHDFRGC?GVRAQRFRFSEEPGPGAD 

R0SCQMNNLPHL0\A/G1*TWGH ISWDLLALPPODI ILASDVFFEP 
EDFEDILATIYFLMHKNPKVOLWSTYQVRSADWSLEALLYKWDM i 
KCVHI PLESFPADKEDI AESTLPGRHTVEMLVI SFAKDSL 


7100 


205 


671 


MlMour riC*nnr\jC! 1j voJjx^ijVW V tr JftonoM i AvuVj.!. wo/wr r itXIJjo VJu 

FLF S FPPQLGD F L>E AF PV FK KY DRNGLNVS 1 E CKRVSGLEPATV : 
DWAFDLTKTNMQTMYEOS E WGWKDREKREEMTDDRAWYL I AWEN 
SS VP VAFSHFR F D VERGDE VLYW 


7101 


2 


503 


VR1LLVGEPRVGKTSL1 MSLVSEEFPEEVPPRAEEIT1 PADVTP 
ERVPTHI VDYSE A3QSDEQLHQE 1 SQANV I C I VYAVNNKHS I DK 
VTSRWI PLlNERTDKDSRlvPIjl LGGKKSDLVE YSR 


7102 


2 


502 


WRGGPRRAKRLAGGAVGWVLLWGVHSVRAGGGRPPRAADMKKD 
VRILLVGEPRVGKTSLIMSLVSEEFPEEVPPRAEEIT1 PADVTP 
ERVPTHI VT3YSEAEQSDE0LHQEISQANVI CI VYAVNNKHSIDK 
VTSRWIPLIKERTDKDSRLPL3LGGNKSDLVEYSR 


7103 




436 


GSOSSVAVNIRSGTDEESMDLMNGQASSVNIAATASEKSSSSES 
LSDKGSELKKSFDAWFDVLKVTPEEYAGQITLMDVPVFKAIQP 
DELS S CGWNKKE K Y$ SAP 


7104 


1670 


795 


RLWEHRSVSAGAS GVJGLS S PGCLLLHP SLP EEERVDI LI NNAGV 
MRCPHWTTEDGFEMQFGVNHLGEAWAGAAP WVQAI LPRR P PK VL 
GF*V* VKSDLFI 3 LN PGHFLLTNLLLDKLKAS AP SRI I N LS S LA 
HVAGH IDFDDLNWQTRKYNTKAAYCQS \KLAI VLFTKELSRRLQ 
GSGVTVNALHPGVARTEIjGRHTGIKGSTFLOHHN\WAJILLAAWS 
KSPRS WPAPAQHNTLAVAEELA\ VISGKYFDGLKQKA PAP EAED 
EEVARRLWAESARLVGLEAPSVREQPLPR 


7105 


765 


143 


GOMCRRPSPKST? CLSMTCDLP/ RGliQDPQCLALFRVAVDKHOA 
LLKAAMSGQGVDK HLFALY I VSRFLHLQS PFLTQVHSEQWOLST 
SQI PVQQMHLFDVHNYPDYVS SGGGFGPADDHG YGVS Y I FMGDG 
M I TFH I S SKKS S T KTDSHRLGQH I EDALLDV AS LFQAGCH FKRR 
FRGSGKENSRHR CGFLS R0TGAS KASMTSTDF 


710$ 


14 


1064 


GLQAGHPHPRSAS RIPEADTH \ YSKLQRAFDS I VNKDHKRMFGT 
YFRVGFFGSKFGDL,DEQEFVyKEPAITKLFEISHRLEAFYGOCF 
GAEFVEVIKDST P VDKTKLDPN KAY IQITFVEPY FDEY EMKDRV 
TYFEKNFNLPJiFMYTTPFTLEGRPRGELHEQYRPJJTVLTTMHAF 
PYIKTRISVIQKEEFVLTPIEVAIEDMKKKTLQLAVAINOEPPD 
AKMIX3MVLQGS VG ATVNQGPLE VAQVFLAE I PADPKLYRKKNKL 
RLCFKEFI MRCGEAVEKNKRLI TADQREYQQELXKNYNKLKENL 
RPMIERKI PELYKP I FRVESQKRDS FHRSS FRKCETQ LS QGS 


7107 


1145 


591 


*1*WLQTGKKK 



597 



WO 01/53312 



PCT/US00/34263 



SEQ 
ID 
NO; 


Predicted 
beginning 

r> t ?f ") t" i /-{to 

location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Precictec end 
nucleotic- 

ccrrespor.cing 
to first 
atv.ino aci<' 
residue c: 
amino acic 
sequence 


Amino acid secment containing signal peptide 
(A Alanine, C=Cysteine, D«Aspertxc Acic, £= 

/I T ii f* bm{ <■? Arin F~ Phpnvl A 1 ani tip fisfil vri r e 
v. J uuoii 1 - v Aiv^u ( * - rjicuyAO ia.J-i.lic t w — \Ju- y \* _i * - \~ , 

K=Kisticine» J-Isoieucine, K=Lysine, 
L=Leucine, M= Methionine, N=Asparagine, 
F=Proline, Q^Glut&mine, R=Arginine, 
S= Serine, T^Threonine, V=Valine, 

Tryptophan, Y=Tyrosine # X^Unknown, *=Stop 
Cocon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 


7108 




94 2 


VK VA Tj I ,L,TN LE 0 P R TE p V* ENS FTI , KMFT iFOFVNLiNS ^ ^ 17 Y T £ F 
FLGRFTGHPGAVLRLINRWRLEECHPSGCIjIDLCMQMGI ■ HVLK 
OTWWFMELGyPLIONWWTRRKVROEHGPERKISFPQWEKDYNL 
C'PMNAYGLFDEVLEMILOFGFTTI FVAAFPLAPLLALLNNI I El 
R LD A YK FVTQWRR PLAS RA KD I G I W YG I LEG 1 G I LS VI TKAF VI 
A3 TSDFI PRLVYAYK YGP CAGQGEAGQKGWG Y VNASLS V FR I S 
DFENRSEPESDGSEFSGTPLKYCRYRDYRDPPHSIjVPYGyTLQF 


7109 


964 


102 


K DQRKRNSLVPG P AHGPAQEEPWEKKES LGAAQEALS 1 Q1»Q?KE 

TOPFPKSEQVYLHFLSWTEDGPEPKDKGSLPQPPITEVESQVF 

SE KLATDTSTFE ATSEGTLELQQRN P KAERLRW S P AQEE S F RQM 

W J H KE I P TGKKDKECSECG KT FI YNSHLWHQ R VHEGE K P Y KC 

rnpp vtcv r\c ctmt r^ourvo ^ UTprvovcrwcrnv ktrrjwci'C'ui \;r\u 
olJv.t3A.JL r A.VobriJLi\jV'»V'**<* Vj&^jrr J&v,WD\.\5is > /\r i\wvj/\JiJLjv vri 

CR3HSGEKPYECNECGKAFSQSSYLSQHRR1HSGEKPF3CKECG 

KAYGWCSEL1RHRRVHARKEPSH 


7110 


96 


697 


R LDN FSG FL VE VTK EERH I V KPl>Y DR YR LVKQM LTRAS3 T P VLG 

KTA VQ VCSS LKNS ESDVE ENQEKLALDLRLSS S RAASMPELLEQ 
LWKARAEKKKLRKTLREFEEAFYOQNGRNAOKEDRVPVLEEYRE 

i K jr. J Knt\L>HL>Likt Vij 1 £> rSJUziis KSs 4. 


7111 


2 


414 


GSGLYRGPTPGGOCIWKPNSMPPDHERNFGFTCFALELNELTAE 
LKRSLPSTDTRLRPD0RYLEEGN10AAEA0KRRIEOLORDRRKV 
MEElWIVKOA^FFRRsyTDSSGKEWWVTNNTYWRLRAEPGyGNMD 
GAVLW 


7112 


103 


495 


PRCFPVADRGRLIGGLPDWTIMEGKTLNLTCTVFGNPDPEVIW 
FKKDQD10LSEHFSVKVE0AKYVSMTIKGVTSEDSGKYS 1 KIXN 
I\.Ytjijfc.K.lUw I Vov i tviit3t.J\l rUMAr ryyA-Vr-vLi-- f/\orvi/vftov 


7113 


1 


824 


KCLRQAWHEAPS S 1AFTRWCSREERAEGGGNLHRS ITRDPKPPG 
URF SQRPKDDKKKKRSPKPCIAQPAGAPGTLRRV P VPTS HSGSL* 
ALGLPHLP SF KQRAKFKRVG KEKGR PV LAGGG SG SAGTP LQH SF 
I/T E VTDVY EMEGG LbNLjLNDFHSGRLQAFGKECS FEQLE KV REM 
OEKLARLH FSLDVCGEEEDDE EEEDGVTEG LPE EQKKTMADRNL 
DOLLSNLGS CLGALVPGGMRGGEGTY SQSHSKALGEKVG WAG SK 
SSGPLNLPRR 


7114 


3 


1492 


VKEVDE0I DHYKESQDKFliWQAAFl GKETLKDESGQECKD CKK1 
IYLNTDFVSVKQRLPKYYSKERCSKHHLNFLGONRSYVRKKDDG 
CKAYWKVCLHYNLHKAQPAERFFDPNQRGKALH0KQA1.RKSQRS 
0 TG EKJL.YK CTECG K VFIQKANLVVHQRTHTGEKP YECCE CA KAF 
C^VCJTT T^UO'DTUTrPK'DVTTQFrVJKTFTOKQTI.IlfHOR'rHTGE 

OVlVO 1 XiXJ\ri\Jl<. i ti 1 \jtLl\Mr 1 £.UoC>V.UiV X I Xyiw 1 Li± AJ1VI\ J" J V3C» 

K PFVCDKC PKAFK S S YHL I RHE KTHI RQAFY KG I KCTTS Shi YQ 
R1HTSEKP0CSEHGKA5DEKPSPTKHWRTHTKENIYECSKCGKS 
FRGKSHliSVHORIHTGEKPYECSICGKTFSGKSHLSVHHRTKTG 
EKPYECRRCGKAFGEKSTIiIVHQRMHTGEKPYKCNECGKAFSEK 
SPLIKHQRIHTGERPYECTDCKKAFSRKSTLIKHORIHTGEKPY 
KCSECGKAFSVKSTLIVHHRTHTGEKPYECRDCGKAFSGKSTLI 
KH0RSHTGDKNL 


711S 


\ 


947 


WEnawnYTTrfr;! .wpmv t t ppnnwi»nRGDESAPIRTPAMIGCSFWD 
REYFGDIGLLDPGMEVYGGENVKLGMRVWQCGGSMEVLPCSRVA 
HIERTRKPYlWD!DYYA]OvNALRAAEWMDDFKSH\rimWKIPM 
SNPGVDFGDVSERlAbRQRljKCRSFKVrxLENVYPEMRVYl^TLT 
YGEVRNSKASAYCLD0GAEDGDRA1LYPCHGMSSQLVRYSADGL 
LQLGPLGSTAFLPDSKCLVDDGTGRMPTLKXCEDVARPTQKLWD 
FTOSGPIVSRATGRCLEVEMSKDANFGLRLVVQRCSGQKl^lRN 
WIKHARfi 


7116 


866 


95 


RVRMRRNAEVIEEKLSMKSWAKFRPGEPWKGYPNIDPETDPYA'T 
PGSVINNLS INTVREVDHLRDRNSGSSSSLNTTLPSTSAWSS IR 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
seouence 


Amino ac:c segment containing signs j peptide 
(A=Alanine, OCysteine, D=Aspartic Acid, E». 
Glutamic Acid, F=Fhenylalanine , G=Giycine, 
H=Histidine, I^Isoleucine, K= Lysine , 
L=Leucine, M=Methionine, N^Asparacint , 
P=Proline, O-Glutamine, R=Arginin£, 
S=Serine, T=Threonine, V= Valine, 
W=Tryptophan, Y=Tyrosine ( X=Unkncvn, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








ASNYNVPLSSTAQSTSARNSDSKLTWSPGSVTNTSLAHELWKVF 
LPPKNITAPSRPPPGLTGOKFPLSTWDNSPLRIGGGWGNSDAKY 
TPGSSWGESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPL3T 
FHLNLPHGNALVRYSSKEEWKAOKSLHISDLFLLTL- 


7117 


695 


1261 


LLISTPGGCKPPPSSIEFTYTGAWGKALPAPHMPCAFGALPOGA 
FVS0AARA2FLLQPS0AAOAEGLSOPARACGALCSLPWPLRNKG 
S P I LRLPGGLRTPTNDRKTRTRS AMACWARAQWDTLG P LKLSKR 
GKVCLRHPRPl-GVRGGPGAAGRQGGMGTRRRGTFTSGARDPGGlj 
RVKHRCQPTGHLP 


7118 


4S 


1863 


FH C E PN PG AG AM V LLH V I j? E KAVG Y ALLALKE VE E 3 S LLQPQVE 
ES VLNLG KFH S I VR LVA FC P FAS SQ VALENANAV S EG WHE DL>R 
LLLETHLPSKKKKVLLGVGDPKIGAAIQEELGYNCOTGGVIAr: 
LRGVRLHFHNLVKGLTDLSACKAQLGLGHSYSRAKVKFNVNRVD 
NM1 IQSISLLD0LDKD1NTFSMRVREWYGYHFPELVKI INDNA7 
YCRLAQFIGKRRELNEDKbEKLEELTMDGAKAKAILDASRSSMG 

AL I G E AVG AR h I AHAG S LTN LAKY PA STVQ 1 LG A E KALFRALK T 
RGNTPKYGLI FHSTFJGRAAAKNKGRISRYLANKCSl ASRIDCF 
SEVPTSVFGEKLREQVEERLSFYETGEIPRKNLDVMKEAMVQAE 
EAAAEITRKLEKQEKKRLKKEKKRLAALALASSENSSSTPEECE 
EMSEKPKKKKKQKPQEVPQENGKEDPSISFSKPKKKKSFSKEEL 
MSSDLEETAGSTS1PKRKKSTPKEETVNDPEEAGKRSGSKKKRK 
FSKEEPVSSGPEEAAGKSSSKKKKKFHKASQED 


7119 


49 


1863 


PHCEPN PGAGAMVLLH VLFEHAVGYAL.LAJLKE VE E J SLLQPQ V v. 
ES VLNLGKFKS I VRLVAFCP FAS SQ VA LENANAVS EG WH EDLR 
I.bLETHLPSKKKKVLLGVGDFKIGAAIQEELGYNCQTGGVlAE: 
LRGVRLHFHKLVKGLTDLSACKAQLGIjGHSYSRAXVKFNVNRVI: 
NM1I0SISLLD0LDKDINTFSMRVREWYGYHFPELVKIINDNAT 
yCRLAQFlGNRRELNEDKLEKLEELTMDGAKAKAILDASRSSNjG 

AL1 GEAVGARL I AHAG S LTNLiAK Y PAS TVQ I LG AE KALFRALKT 
RGNTPKYGLI FHSTFIGRAAAKNKGRI SRYLANKCS J ASRIDCF 
SEVPTSVFGEKLREQVEERLSFYETGEIPRKNLDVMKEAMVQAE 
E AAAE ITRKLE KQEKKRLKKEKKRLAALALAS S ENS S S TPEECE 
EMSEKPKKKKKQKPQEVPQENGMEDPS I SFSKPKKKKS PSKEEL 
MS SDLEETAGS TS I PKRKKSTP KEETVNDPEEAGH'K S GS KKKR K 
F S K EE P VS SG F E E AAGKS S S K KK KKFHKASQED 


7120 


1S91 


64 


QLGTRRCI^GDKVTNAMQDFLVTNLEPRFIEPQTAKI.SVVFKDS 
NSTTPL I FVLS PGTDPAADL YKFAEEMKFSKKLS A I S LGQGQG F 
RAEAMMRSS1 EKGKWVFFQNCHLAPSWMPALERLI EK1NPDKVH 
RDFRLWLTSLPSNKFPVSILONGSKMTIEPPRGVRWaLLKSYSf 
LGEDFLNSCKKVMEFKSLLLSLCLFHGNALERRKFGPLGFNIPY 
EFTDGDLRI CIS QL KMFLDE YDD I P Y KVLKYTAG E 1 NYGGRVTD 
DWDRRCIMNlLEDFYNPDVLSPEHSYSASGIYHOlPFTYDLHGy 
LSYIKSLPLNDKPEIFGLHDNANITFAQNETFALLGT1IQLQPK 
S S SAGS QGREE I VED VTQN I LLKVPEP 1NLQWVMA KY P VLYEES 
MNTVLVQEVIRYNRLLQV1T0TLQDLLKALKGLVVKSSQLELKA 
AS LYNNTVPELWS AKA Y PS LKPLS S WVMDLLQRLD FLO AW I QDG 
IPAVFWISGFFFPOAFLTGTLQNFARKFVISIDTISFDFKVMFE 
APSELTQRPQVGCYIHGLFLEGARWDPEAFOLAES0PKELYTEM 
A V I WLli PTPNR KAQD QDFYLC P I YKTLTRAGTLS T TGH STNYV j 
AVEI PTHQPQRHWI KRGVAL1 CALDY 


7121 


2 


546 


RPLRPWVLSLGSMVGLMTYGRRQFQSLDTTMRKL1 PF FREASAK 
LTTLVDADAEAFTAYLEAMRLPKNTPEEKDRRTAALOEGLRRAV 
S VP LTLAET V AS LW P ALQ ELAR CGNLACRS DLQVAAKALEMG V F 
GAY FNVL I NLRD I TDEAFKDO I HKRVS S LLQEAKTQAALVLDCL 
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SEQ 
ID 


Predicted 
beginning 

location 
corresponding 
to first . 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 

lOtOt lull 

corrc sponci no 
to first 
amino acic 
residue of 
amino acid 
secruence 


Amino acid segment containing signal peptide 
(A=Alanine, C-Cysteine, D=Aspartic Acid, £= 

V3 jl u t_ elm 1 c ACiU/ r — rTitiiiy laianiiic , <o— uiywiat: , 
P=Histidinp T = I <sol eurin<» K=I/vsin&. 
L=Leucine, M=Methionine, N=Asparagine, 
P^Proline, Q=Gl\Jtamine , R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y= Tyrosine, X~Unknown, +-Stop 
Codon, /~possible nucleotide deletion, 
\ -possible nucleotide insertion) 








ETRQE 


7122 


2 


54 6 


R PLR PV? V LSLG S1WGLMTYGR RQ FQS LDTTMR RL1 P PFR EAS AK 
LTTLVDA DAE A FTA YL EAMRL F KN TP EE KDR R TAALQEG LRRAV 
S V PLTLAET VAS LWP ALQELAR CGNIACRS D LQV AAKALEMG VF 
GAYFNVLINLRDITDEAFKD03HHRYSSLL0EAKTOAALVLDCL 
ETRQE 


7123 


3 


1092 


KPAVPEARSAGTSEAGRSGAEEVSCGSVSGDGAAMRLTPRALCS 
r\ T\ rift h uidc went /""r^Dni r 7\ DUCrruMi\ v/^T i/UMncci VI tfnpTTC 

VKDARI PLSGRW PLFQETUSLKPHLLVLNKMDLADLTEQOKIMQ 
HLEGEGLKKVI FTNCVKDENVKQ 1 1 PWVTELIGRSHRYHRKENL 

MSKI QVSERPLMFLLDTPGVUAPR I ESVETGLKIjALCGTVLDHL 
VGEETMAD YLL Y TLNKHQRFG Y VQH YGLGS A C DNVERVLKS VAV 
KLG KTQKVKVLTGTGNVNVIOPNYPAAARDFLQTFRRGLLGSVM 
LDLDVLRGKPRV 


7124 


2 


382 


LPLTLLLAAPFAHLLLPPGHD£>S PCWHPGPALS PGTLGPLSWAM 
ANSGLQLLGY FLADGG WVGI I ASTALPQWKQS S YAGDAS I QLRS 
KV FV LESEWGGD S LGLPRDCG WS C LLH S AVR SE KG F WS 


7125 


166 


1127 


NCISEKKNYSFSMQKGKGRTSRIRRRKLCGSSESRGVNESHKSE 
F I ELR K WIjKAR K FQDSNIiAPACF PG TGRGLMSQTS LQEGQM I 1 S 
LPESCLUT\RDT VIRSYLGAY 3 TKWKPPPSFLLALCTFLVSEKH 
AGKRSLLEA\YbEILPKAYTCPVCLEPEWNLLPKSLKAKAEEO 
RAHVQEFFASSRDFFSSLQPLFAEAVDSIFSYSALLWAWCTVNT 
RAVYL\ SPG SGN AFLQS RTPV QliAP YLDLLNHS PHV QVKAAFNE 
ETHSYEIRTTSRWRKHEEVFICYGPHDNQRLFLEYGFVSVHNPH 
ACVYVSRGWKQLCS 


7126 


1 


733 


CRDMAAFI VPS PARRCSQKGSLGHLPTQPV?LWAAMS PRGOERGT 
SHSOAREPORPGRWLLGSLQSSPGTLGQAGTASRRRGCMVQRWV 
0 VATGRRAVOV P KGALGLALGETS PG AS RGMS GGAGGCWALGWA 
PSPVLPSWLLEGPPPWLS1ISDSGTQRPSPRRCPARPSPWGPQC 
WRGGRIASAEASST*TPGSGSRARSGRRSPGSRRRSASAPSPTP 
PTDACA* SCVAR PAGSR SSRPAAA 


7127 


1312 


277 


GLPAMCST*KAGYYEETEGDC1PKDR* IEKRPFKE1 * RRIPRIF 
AKQKQ I * S *NS QKIGAS EIDRGR KEADCSDAPAAAR I GAVSVFR 
RSTOEARVSPRSNAKSANLRAVRAD*V?EHFVLLFHTPEQFIiAEC 
I CRST* * K*V?H03-»C* PLSSL* TGliKRKLLL* VLFRI *WLKDCDV 
* FCOKI FATNFCNWQNLIQ* EE* KPVEYSVEN* HIMNLLLPM+L 
COS S LRDQT I VT WRM* RN YSM FR 2 NM I S SI** DGSIHI PLKLHFY 
PALI FTLTVPI NS CCQRPLPLFAHQS 3 KTLAS SGS PMLACLRFL 
LVKKRAFIHTPRSPGCSV*CKHVX>VKDNKNNCVGSEV 


712B 


2 


5228 


GRVDLWTILLGRSALRELSQIEAELNKHWRRLLEGLSYYKPPSP 
SSAEKVKMKDVASPLKELGLR3SKFU5LDEEOSVQJ..LQCYLQE 
DYRGTRDSVKT VLQDERQSQALI LKI AD YYYEERTC I LRCVLHL 
LTYFODERHPYRVEYADCVDKLEKELVSKYRQOFEELYKTEAPT 
WETHGNLMTERQVSRWFVQCLREOSMLLEI I FLYYAYFEMAPSD 
LLVLTKMFKEQG FGSRQTNRHLVDETMDPFVDRIGY FSALILVE 
GMDI ESLHKCALDDRRELHQFA0DGL1 CQDMDCLMLTFGD I PHH 
AP VLIAWALLRHTLNPEETSSWR KI GGTAI QLNVFQ Y LTRLLQ 
SIjASGGNDCTTSTACMCVYGLLS FVLTSLEIiHTLGNQQDI I DTA 
CEVLADPSLPELFWGTEPTSGLG I ILDSVCGMFPHLLSPLLQLL 
RAL VSGK5 TAKK V YS FLDKMS FY NEL YKHK PHDVI SHEDGTLWR 
ROTPKLLYPLGGCTNLRIPQGTVGQVMLDDRAYI.VRWEYSYSSW 
TLFTCE I EMLLHWSTADVI QHCQR VKP I 1 DLVHKV I STDLS I A 
DCI^PITSRIYWLLQRLTTVISPPVDVIASCVNCLTVI^AARNPA 
KVWTDLRHTGFLPFVAHPVSSLS QMI S AEGMNAGGYGNLLMNSE 
OP0GEYGVTIAFLRLITTLVKG0LGSTQS0GLVPCVMFVLKEMI. 
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SEQ 
ID 
NO: 


Predicted 

beginning 

nucleotide 

location 

tor z t_ bpoiiuing 

to first 

amino acid 

residue ol 

amino acid 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing s:cnal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F«Phenylalanine, G=Glycine, 
H^Histidine, I^Isoleucine, K=Lysine, 

P= Proline, Q=Glutamine, R=Arginme, 
S^Serine, T=Threonine, V=Valine, 
W^Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion; 








PSYHKWRYNSHGVREQIGCLILELIHAILNLCHETDLHSSHTPS 
LOFLClCSLAYTEAGOTVINIMGIGVDTIDI^VViAAQPRSDGAEG 
QGQGQLLI KTVKLAFS VTNNV I RLKFPSNW S PLEQALSQHGAH 
GNNLIAVLAKYIYHKHDPALPRLAIOLLKRLATVAPMSVYACLG 
NDAAAI RDAFLTRLQS K\ I E \DMR I K\ VM I L\ EFLT VA\ VETQP 
GLIELFLNLEVKDG\SDGSKEFSLGMW\SCLKAV/VWELIDSQO 
0DRYWCPPLLHRAA3AFLHALWQDRRDSAMLVLRTKPKFWENLT 
SPLFGTLSPPSETSEPSILETCALlMKIICLEIYYWKGSbDQP 
LKDTLKKFSIEKRFAYWSGYVKSLAVHVAETEGSSCTSLLEYOM 
LVSAWRMLLIIATTHADIMHLTDSWRRQLFLDVLDGTKALLI.V 
P AS VN CLRLGS MKCTLLLI LLR0W KRE LGS VD E I LG P LTE I LEG 
VLQADQQLMEKT KAK V FS AF I TVLQMKEMKVS D I PQ Y S QL V LNV 
CETLOEEVIALFDQTRHSLALGSATEDKDSMETDDCSRSRHRDQ 
RDGVCVLGLHLAKELCEVDEDGDS WLQVTRRLP 3 LPTLLTTLEV 
SLRMKQNLHFTE ATLHLLLTLARTQQGATAVAGAGI TQS 1 CLPL 
LSVYQLSTNGTAQTPSASRKSLDAPSWPGVYKLSMSLMEQLLKT 
LRYNFLPEALDFVGVHOERTLOCLNAVRTVQSLACLEEADHTVG 
FILQLSNFMKEWHFHLPQLMRDIQVNLGYLCC'ACTSFLHSRKML 
QHYLQN KNGDGLPS AV \AQRV \ QR PPS AAS AA PS SSK0P A7VDTE 
AS EQOALHTVQYGLLKI LS KTLAALRH FTPD VCQ ILLDQ S LDJjA 
EYNFLFALSFTTPTFDSEVAPSFGTLLATVNVALNMLGELDKKK 
E PLTQAVGLS TQAEG TRTLKS LLMFTMENCFY LLISQAMRYLRD 
PAVH PRDKQRM KQELS SELSTLLS SLS R YFR K G APS S PATG VLP 
SPQGKSTSLSKASPESQEPLIQLVQAFVRHMQR 


712$ 


1 


1054 


FRRFRWRRRLH'AGPASSAGGSPGEASGTMSGELPPNINIKEPR 
WDQSTFIGRANHFFTVTDPRNILLTNEQLESAKKIVHDYRQGIV 
PPGLTENELWRAKYI YDSAFHPDTGEKMILIGnMSAQVPMNMI 1 
TGCMMTFYRTTPAVLFWQWINQSFNAVVNYTKRSGDAPbTVNEL 
GTAYVSATTGAVATALGLNALTKHVSPLIGRFVPFAAVAAANCI 
N I PLMRQR ELKVG I PVTDENGNRLGESANAAKQAITQVWSRI L 
MAAPGMAI PP FI MNTLEKKAFLKRFPWMSAP 1 0 VGLVG FCLVFA 
TPLCCALFPQKSSMSVTSLEAELOAKIQESHPELRRVYFNKGL 


7130 


2 


760 


HEVPSLQTSDPLPbSVyRCSVvVbQPNjUiNWCOU^ 
ISAKSQPYHRSQSSSSVLINKSMDSINYPSDVGKQQLLSLHRSS 

or^xrcti^r^r T Tjr\T AnourMVTu vt cm tt i~st^o nv\r\AAnJDXn .DT .TJ 
K^i^^ti\^Uui^t'UlJ\Uatl\J\Jsji CmISAjqUIj I jJulJSv** " v V vr**ci>»Jjr\Lww 

AQIATQNYFSNFKETDGDEDDYVEIKSEEDESELELSHNRRRKS 

DSKFVDADFSDNVCSGNTLHSLNSPRTPKKPVNSKLGLSPYLTP 


7131 


805 


573 


AAAEGHIEWKFLIEACiCVNPFAKDRWGNrPLDDAVQFNKLEVV 
KLLQDYQDSYTLSETQAEAAAEALSKENLESMV 


7132 


1420 


1067 


IDMLLLSGALVSGPYTLITTAVSADLGTHKSLKGNAHALSTVTA 
1 1 DGTGSVGAALGPLLAGLLS PSGWSNVFYMLK FADACALLFL I 
RL IHKELSCPGSATGDQVP FKEQ 


7133 


2 


3648 


001PGLLPAHGESGDALRKPRLQKPITGHLDDLFFTLYPSLEKF 
EEELLELHVQDHFQEGCGPLDGGALEILERRLRVGVHNGLGFVQ 
RPQVWLVPEMDVAliTRSAS FS RKWS SS KTS S GSQAL VLRS Rl» 
RL PEMVGHPAFAVI FQLEYVFS S PAGVDGNAAS VTSLSNLACMH 
MVRWAVWNPLLEADSGRVTLPLQGGIQPNPSKCLVYKVPSASMS 
SEEVKQVESGTLRFQFSLGSEEHLDAPTEPVSGPKVERRPSRKP 
PTSPSSPPAPVPRVLAAPQNSPVGPGLSISQLAASPRSPTQHCL 
ARPTSQLPHGSQAS PAQAQEFPLEAGI SHLEADLSQTSLVLETS 
IAEQLQELPFTPLHAPIWGTQTRSSAGQPSRASMVLLOSSGFP 
EILDANKQPAEAVSATEPVTFNPQKBESDCLOSNEMVLQFLAFS 
R VAQDCRGTS W PKTVY FTFQF YR FP P ATTPRLC L VQLDE AGQ P S 
SGALTHILVPVSRDGTFDAGSPGFOLRYMVGFGFLKPGERRCFA 
RYLAVQTLQI DVWDGDSLLLI GSAAVQMKHLLRQGRPAVQASHE 
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BNSDOCID: <WO 0153312A1J_> 



WO 01/53332 



PCTAJS00/34263 



SEC 
IP 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
aTiino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
1 oca t ion 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid Eegroent containing signal peptide 
(A«=Alanine, OCysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Fhenylalanine , G=Glycine, 
H=Histidine, 1=1 soleucine, K=Lysine, 
L=Leucine, M= Methionine, N=Asparagine, 
P=Proline, 0=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V^Valine, 
W^Tryptophan, Y=Tyrosine, X= Unknown, *=Stop 
Codon, ApoGSible nucleotide deletion, 
\=possible nucleotide insertion) 








LE W ATEYEQDNM W SGDMLGFGRVKPI GVHS WKGRLHLTLAK 
VGHPCEQKVRGCSTLPPSRSRVISNDGASRFSGGSLLTTGSSRR 
XHWQAQKIJ^VDSEIAAMLLTHAROGKGPQDVSRESDATRRRK 
LERMRSVRLOEAGGDLGRRGTSVLAQOSVRTOHLRDLQVIAAYR 
ERTKAES I ASLLSLAI TTEHTLHATLG VAEFFEFVLKNPHNTQH 
TVTVEIDNPELSV1VDSQEWRDPKGAAGLKTPVEEDMFHLRGSL 
APQLYLRPHETAHVPFKFQSFSAGOLAMVOASPGLSNEKGMDAV 
S PWKS S AVPTKJIAKVLFRASGGKP I AVbCLTVELQPHWDQVFR 
FYHPELSFLKKAIRLPPWKTFPGAPVGMLGEDPPVHVRCSDPNV 
1CETQNVGPGEPRD1 FLKVASGPSPEIKDFFVI I YSDRWLATPT 
OTWOVYLHSLORVDVSCVAGQLTRLSLVLRGTQTVRKVRAFTSH 
PQELKTDPKGVFVLPPRGVODLHVGVRPLRAGSRFVHLNLVDVD 
CHOLVAS WLVCLCCRQPL J S KAFE IMLAAGEGKGVNKRI TYTNP 
YPSRRTFHLHSDHPELLRFREDSFQVGGGETYTIGLQFAPSQRV 
GEEEJLIY1NDHEDKNEEAFCVKVIYQ 


7134 


2115 


111 j 


GGEGFSYPPHVGLSLGTPLDPHYVLLEVHYDNPTYEEGLIDNSG 
LRLFY TMD1 RKY DAGVI EAGLWVSLFHTI PPGMPEFQSEGHCTL 
ECLEE ALE AE K P S G I HVFA V L LHAHLAG RG I R LRHFRKG KEMKL 
LAYDDDFDFNFOEFQYLKEEOTILPGD^LITECRYNTKDRAEMT 
WGGLSTRSEMCLS yLLYYPRINLTRCASl PDIMEQLQFIGVKEI 
YRPVTTWPFUKGP KQ Y KN hS FMDAMNKFKWT KKEGLS FN KL VL 
SLPVNVRCSKTDNAEWS 1 OGMTALPPDI ERPYKAEPLVCGTSSS 
SSLHRDFS 1KLLV CLLLLS CTLSTKSL 


7135 


2 


2072 


FVPRVTPRSLSLOGFKGESVGSITQPLPSSYLIFRAASESDGRC 
WLDALEIALRCSS LLRLGTCKPGRDGEPGTS PDAS PSSLCGLPA 
SATVHPD0DLFP1.KGSSLE1?DAFSDKSERENPEESDTETQDHSR 
KrESGSDOSETPGAPVRRGTTYVEQVQEELGELGEASQVErvSE 
ENKSLMWTLLKQLRPGMDLSRVVLPTFVLEPRSFLNKLSDYYYH 
ADLLSRAAVEEDAYSRMKLVLRWYLSGFYKKPKGIKKPYNPILG 
ETFRCCW FHPQTD S R T F Y 1 AE 0 VSHH P P VS A FH VS NRKDG FC IS 
GS ITAXSRFYGNS LS ALLDG KATLTFLNRAEDYTLTMPYAHCKG 
I LYGTMTLELGGKVTI ECAKKN FQAQLEFKLKPFFGGSTS I NQI 
SGKITSGEEVLASLSGHV3DRDVFIKEEGSGSSALFWTPSGEVRR 
QRLROHTVPLEEOTELESERLWQHVTRAISKGDQHRATQEKFAL 
EEAQRQRARERQESLMPWKPQ1.FHLDPITQEWHYRYEDHSPWDP 
LKDIAQFEQDGlLRTLiQQEAVARQTTFLGSPGPRHERSGPDORL 
RKASDQPSGHSQATESSGSTPESCPELSDEEQDGDFVPGGESPC 
PRCRKEARRLQALKEAILS 1 R EAQQELHRHLSAMLS STARAAQA 
PTPGLLOS PRSWFLLCVFLAC0LFINHI1>K 


7136 j 2 

i 


41fc 


DFVPfcFRKPSGJN 2 EQTVWIjLRAATIjEKEVAGIjREKIHHLDDMLK 
S0ORK\ r RQMIE0LONSKAVl 0SKDAT1 QBLKEKXAYLEAENLEM 
HDRMEHL1EKQISHGNFSTOARAKTENPGSIR1SKPPSPKPMPV 
IRWET 


7137 


2 


a a c 


GSFKVATQERNPORAOMRLRROKKGWPFLGDFLTELQRLDSAI 
PDDLDGNTNKRSKEVRVLOEMOLLQVAAMNYRlrRPLEKFXTYFT 
P mpot . n y p c vu I , <i rriT .p p pn p 


713B 


2 


46(. 


WASGMSTVPGGSRHSLGIOVRGGWGVTGGEEESLTVPVADTW0A 
GSFKVATCERNPORAQMRLRR QKKG VVP FLGDFLTELQRLDS A I 
PDDLIXSNTNKRSKEVRVLOEMQLLQVAAI^YRLRPLEKFVTYFT 
RMEQLSDKESYKLS CQLEPENP 


7139 


1 


357 


SLRNSARGLKMAASAARGAAALRRSINQPVAFVRRIPWTAASSQ 
LK EHFAQFGHVRR C I L PFDK E TG FHRG LG WVQFS SEEGLRNALQ 
QENHI IDGVKVQVKTRRPKLPOTSDDEKKDF 


7140 


1401 


1S57 


RASSI^VLKAWGGLIPSSFQOOHTGOYALEELFDLKVYDCFCSF 
NMNVS LE KOLRP S C P W PRGKCR KTPGWEE ARP KAQDLRGDLG KT 
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SEQ 
ID 

NO: 


Predicted 
beginning 
nucleotide 
location 

to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 

ami no sri a 
sequence 


Amino acid secment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G^Glycine, 
H=Histidine, 3=Isoleucine, K=Lysine, 

T . — 7 .on no M — Mo h Vi i p* *"i r\o MxLcnarani ne 

ij~xjeut»iiie , ri— L-iij-w.] j.ne / iv-rtbpdidy jne, 

P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 

WsTTvnfconhan v =Tvrn«;inP X— Ilnknoii/n *-C^ oy> 

• » — -i- a jr^y t w^/i tun, .» — ^ y i. jliic , y» — l #\i is./ «n / -iji-UL) 

Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








0AGPAEAHTRGPPRLPAATGCPPHLPGLLSGISVD1DPTGLQSQ 
WTPKGQDPPLMFSEDYOKSLLEOYHLGLDOKLRKyWGBLlMNF 
ADFMTNQCG 


7141 


124 


1073 


LDSRSCWLDMEDLEEDVRFIVDETLDFGGLSPSDSREEEDITVL. 
VTPEKPLRRGLSHRSDPNAVAPAP0GVRLSLGPLSPEKLEE1LD 

VRDLLPTVNSLTRSTPS /LKQPDASTPE * * *EGVSQGSPGYI WK 
EALQHEEGVTHLQSVPCIOKPSIFSS\SRSTPPVRGRAGPSGRA 
AASEETRAAKLRGAAAKSSCOLP I PSA1 PRPASRMPLTSRSVPP 

f3Rf5ZVT.T>Pn<51 .^TBV^l.DRP^TlknHRVRPQnUin/DArQnDT TvTT.D\rM 
\jr{.\jJ\Ltr trUO Jjo 1 IvlV^JJr'K^^ 1 AunKvK50oHAVrvoyKLI\IJjrvI»i 

GATRSNLQPP 


7142 


658 


B39 


hi FLMLHMEliKMbS S VTLHIRAFLYW I CLKPTSClil FQNVbNLL 
KK* SRAVGWWMCRT/ YSSDLQVGVI KPWLLLGSQDAAHDLDT 
LKKNKVTHI LNVAYG VENAFLSDFTYKS I S I LDLPETNI LSYFP 
ECFEFIEEAKRKDGWLVHCNA 


*71 d 


J 


1 /J 


SijbWbblAar.PLibKWDShDb 1 i: ST1 MD VDbTI£>SL>RSTPAMMNGQ 
GSTTSSSKN1 AYNCC WDQCO A C FN S S PDLADH I RS I HVDGQRGG 
VFVCLWKGCKVYNTPSTS0SWU3RHMLTHSGDKPFKCWGGCNA 
S FASQGGLARHVPTH FSQONS S KVS SOPKAKEESPSKAGMNKRR 
KLKNKRRRS1ARPHDFFDAQTLDAIRHRAICFNLSAH1ESLGKG 
HSWFHSTVSILLFFQIKYKTLQKNISTIISKSLKI 


7144 


1 


988 


FRVNMQDGGPSPAEHSKAEESAGMEARFLGLPDAAGSSGPTPAR 
R CPAPRPAG VS YVI RDEVEK YNRNGVNALQLDPALNRLFTAGRD 

QT TPTW < 5VNnVTK , nnPYT ZiCJMPHPTnUVMriTVri.PPMnifTT.T Clcq 
ji "O V IN II IN.V,/ \J t I A.M.O l lJw tltt J. t/W vimu J. V DV^^INVj tv. 1 uionOO 

DTT VKVWNAH KG FCM S TLR TH KD YVKALAYAKDKE LVAS AGLDR 
OIFLWDVNTLTT^TASNNTVTTSSLSGNKDSIYSLAMNQLGTII 
VSGSTEKVLRVWDPRTCAKLMKLKGHTDNVKALLLNRDGTQCLS 
GSSDGTIRLWSLGQORCIATYRVHDEGVWALQVNDAFTHVYSGG 
RDRKI YCTDLRNPDI R VLI CE 



TRADOCS: 1 4 1 6260. 1(%CSK0I !.DOC) 
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WHAT IS CLAIMED IS: 

1. An isolated polynucleotide comprising a nucleotide sequence selected from the 
group consisting of SEQ ID NO:l-1786 and 3573-5358, a mature protein coding portion 
of SEQ ID NO:I-1786 and 3573-5358, an active domain of SEQ ID NO:M786 and 
3573-5358, and complementary sequences thereof. 

2. An isolated polynucleotide encoding a polypeptide with biological activity, 
wherein said polynucleotide hybridizes to the polynucleotide of claim 1 under stringent 
hybridization conditions. 

3. An isolated polynucleotide encoding a polypeptide with biological activity, 
wherein said polynucleotide has greater than about 90% sequence identity with the 
polynucleotide of claim 1 . 

4. The polynucleotide of claim 1 wherein said polynucleotide is DNA. 

5. An isolated polynucleotide of claim 1 wherein said polynucleotide comprises the 
complementary sequences. 

6. A vector comprising the polynucleotide of claim 1 . 

7. An expression vector comprising the polynucleotide of claim 1 . 

8. A host cell genetically engineered to comprise the polynucleotide of claim 1 . 

9. A host cell genetically engineered to comprise the polynucleotide of claim 3 
operatively associated with a regulatory sequence that modulates expression of the 
polynucleotide in the host cell. 

1 0. An isolated polypeptide, wherein the polypeptide is selected from the group 
consisting of: 
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(a) a polypeptide encoded by any one of the polynucleotidesof claim 1 ; and 

(b) a polypeptide encoded by a polynucleotide hybridizing under stringent 
conditions with any one of SEQ ID NO: 1-1786 and 3573-5358. 

11. A composition comprising the polypeptide of claim 1 0 and a carrier. 

12. An antibody directed against the polypeptide of claim 10. 

•13. A method for detecting the polynucleotide of claim 1 in a sample, comprising: 

a) contacting the sample with a compound that binds to and forms a 
complex with the polynucleotide of claim 1 for a period sufficient to form the complex; 
and 

b) detecting the complex, so that if a complex is detected, the 
polynucleotide of claim 1 is detected. 

14. A method for detecting the polynucleotide of claim 1 in a sample, comprising: 

a) contacting the sample under stringent hybridization conditions 
with nucleic acid primers that anneal to the polynucleotide of claim 1 under such 
conditions; 

b) amplifying a product comprising at least a portion of the 
polynucleotide of claim 1 ; and 

c) detecting said product and thereby the polynucleotide of claim 1 in 

the sample. 

15. The method of claim 14, wherein the polynucleotide is an RNA molecule and the 
method further comprises reverse transcribing an annealed RNA molecule into a cDNA 
polynucleotide. 

1 6. A method for detecting the polypeptide of claim 1 0 in a sample, comprising: 
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a) contacting the sample with a compound that binds to and forms a 
complex with the polypeptide under conditions and for a period sufficient to form the 
complex; and 

b) detecting formation of the complex, so that if a complex formation 
is detected, the polypeptide of claim 10 is detected. 

17. A method for identifying a compound that binds to the polypeptide of claim 10, 
comprising: 

a) contacting the compound with the polypeptide of claim 10 under 
conditions sufficient to form a polypeptide/compound complex; and 

b) detecting the complex, so that if the polypeptide/compound 
complex is detected, a compound that binds to the polypeptide of claim 10 is identified. 

18. A method for identifying a compound that binds to the polypeptide of claim 10, 
comprising; 

a) contacting the compound with the polypeptide of claim 10, in a 
cell, under conditions sufficient to form a polypeptide/compound complex, wherein the 
complex drives expression of a reporter gene sequence in the cell; and 

b) detecting the complex by detecting reporter gene sequence 
expression, so that if the polypeptide/compound complex is detected, a compound that 
binds to the polypeptide of claim 10 is identified. 

1 9. A method of producing the polypeptide of claim 10, comprising, 

a) culturing a host cell comprising a polynucleotide sequence selected 
from the group consisting of a polynucleotide sequence of SEQ ID NO:l-1786 and 3573- 
5358, a mature protein coding portion of SEQ ID NO:l-1786 and 3573-5358, an active 
domain of SEQ ID NO: 1-1 786 and 3573-5358, complementary sequences thereof and a 
polynucleotide sequence hybridizing under stringent conditions to SEQ ID NO: 1-1 786 
and 3573-5358, under conditions sufficient to express the polypeptide in said cell; and 

b) isolating the polypeptide from the cell culture or cells of step (a). 
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20. An isolated polypeptide comprising an amino acid sequence, selected from the 
group consisting of any one of the polypeptides SEQ ID NO:1787 -3572 and 5359-7144, 
the mature protein portion thereof, or the active domain thereof. 

21 . The polypeptide of claim 20 wherein the polypeptide is provided on a polypeptide 
array. 

22. A collection of polynucleotides, wherein the collection comprising the sequence 
information of at least one of SEQ ID NO:l-1786 and 3573-5358. 

23. The collection of claim 22, wherein the collection is provided on a nucleic acid 
array. 

24. The collection of claim 23, wherein the array detects full-matches to any one of 
the polynucleotides in the collection. 

25. The collection of claim 23, wherein the array detects mismatches to any one of 
the polynucleotides in the collection. 

26. The collection of claim 22, wherein the collection is provided in a computer- 
readable format. 

27. A method of treatment comprising administering to a mammalian subject in need 
thereof a therapeutic amount of a composition comprising a polypeptide of claim 10 or 20 
and a pharmaceutically acceptable carrier. 

28. A method of treatment comprising administering to a mammalian subject in need 
thereof a therapeutic amount of a composition comprising an antibody that specifically 
binds to a polypeptide of claim 10 or 20 and a pharmaceutically acceptable carrier. 
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